'03*07fll 48(0)14*400 jjfrB S K ] 



&W$mrt~W R; 624 

PCT/US97/03 315 



U.S. PA' 



/PENT application^ 



NAMI A. PACfXLA, HAWICH, MC; SMOJRA L. VQUHATH, DUftHAM, HCj KMC A. 



••rOWICH/PCT APPLICATIONS*' 

vmriD 



recd 1 5 m - 



WtPO 




PCT 



rostzcH nine nctwsi crjwteo "7/22/m 



SSnrf 

KC 


0 


TOTAL 
GUMS 


3MCS 


tiso.oo 


PH/5-20756/P 


MICflAtL 
CXAA-Cf 

6' $20 WMI 
| TAJAYTO 


If 6LYNM 
lOV CORPORA 
tt PLAINS R 
MM M 10S9 


TIOW PATWT 
OAQ P O bOX 
1-9005 


PXPt 
200S 


PM*9TKM PAOH PLAHT PfcOTOFCWftfRJHOGiJ* CXZDASR GWCS 

i 



- it to cwtifv that inroad h#r»to 11 tw» «*y frpmjrw f««o**iol *• Unit* S 

MR* HmrtMP .UOBMMCl 

MAR I 7 




$^0003 




PROVISIONAL PATENT APPLICATION COVER SHEET 



JfecclLa 
Volr«th ~" 
Ward 



Siuir* ' 

"Erie 



l*lci(h. North Carolina . ■ > 
-TRku, north Cirollnp k I t . - 
IjtirfeM'. Rorth CmoIIm ' < 



THU 0* THE INVENTION (2*0 1» 

ttom>W ftp* Plwit Protoporphyioogen OzldaM Ccbm 



1— J landMyMm) 
f~| _K«(I6(W((I) 



[x] J-p*- 1 
On*)***) 



j MOVtSiOMM. FIMG FEE AMQUMT |l 



□s 



I* •* (9 t9) 5*1-8614 
°** June 21. 1M6 



/name • Ju*« Scott Umt' 

^ *TTOBNeY/*CEMT ... 
0£CiST«*TlON NO 36 1 1Z9 



I 



I 

i 



J 03*0 7fll4flCfl) 14*41^ fflt'.ll S K B 



R : 6 2 4 



P. 48 




VQ20003 

FROM PLANT PROTOPORPHYRINOGEN OXIDASE GENES 



CROSS-REFERENCE TO RELATED PROVISIONAL 
This proviwonal application is related to U.S. provisional application serial fto. 
«WlJ.612fikd February 28. 1996. 



FIELD OF THE INVENTION 
This invention relates to novel DNA sequences which function as promoters of 
transcription of associated DNA sequences in plants. More specifically, this invention relates to 
novel promoters which ire naturally associated with plant protcforfhyrinogen oxidase (protox) 
coding sequences, 



BACKGROUND OF THE INVENTION 
I. The ProtAK Enzyme aod 1U Involvement ia the Chlorophyll/Heine Bjoeyntbetk 
ii Pathway 

The btosytuhciic pathways which lead to the production of chlorophyll and heme share a 
number of common steps. Chlorophyll is • light harvesting pigment present in til green 
photosynihetic organisms. Heme is a cofactor of hemoglobin, cytochromes, P450 miwo-funoion 
oxygenases, peroxidases, and caulases {set. e.g. Uehninger. pirjehemistry . Worth Publishers, 
H . ;o New York ( 1975)), and is therefore a nereisary component for all aerobic organisms. 

"""^ The last common step in chlorophyll and heme biosynthesis is the oxidation of 

protoporphyrinogen IX to protoporphyrin IX. Protoporphyfinogen oxidase (referred to herein as 
"protot") is the enzyme which catalyzes this lasi oxidation step (Matringc ei al„ Biochem. I 260: 
231 (1989)). 

2S The protox enzyme tw been purified either partially or completely from a number of 

'\ organisms including the yeast Saceharomyces cenwiae (Labbe*BoiS and Labbe. In BionynBiCs is 

o f Hetne and Chlorop hyll. E M. Dailey. ed. MeCnw Hill: New York. pp. 235-285 (1990)). barley 
etioplasls (Jacob* and Jacobs. Biachtm. J, 244: 219 ( 19B7j), and mouse liver (Dailey and Karr, 
BUtckem. 26: 2697 (1987)). Genes encoding protoa have been iwUicd from two prokaryotie 
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organisms, Escherichia coti (Stunsii> et cL. Can. I Microbiol 39; 11 55 (1993)) and Bacillus 
tubtilu (Daitey et at. J. Biol Chem. 269: 81 3 ( 1994)). These genes share no sequence srmilarrry; 
neither do iheir predicted proteh products share any amino acid sequence identify. The £ coti 
preuiB is approximately 21 kDa, and associates with the cell membrane. The 0. subtilit protein is 
51 fcD*. and is » soluble, cytoplasmic activity. 

Prow* encoding cDNAs have now ai» been isolated from humans (see Nishinwra et oL, 
J. Biol Chem. 170(14): 8076-8080 (1995) and plants (International application oo. 
PCT/IB95AXWS2 filsJ June £. 1995. published Dec. 21. 1995 as WO 95734659). 



H. The Pretax Gene a* a Herbicide Tarfet 

The use of l*roki«k* to control undesirable vegetation sveh as weeds or plants in crops 
has become almost a universal practice. The relevant market exceeds a billion dollars ■"Jiually. 
Despite this extensive use. weed control remains a significant and costly problem for farmers:. 

Effective use of herbicides requires sound iruuiagemetu. For instance, time and method of 
application and stage of weed plant development are cities! to getting good weed control with 
herbicides. Since various weed species are resistant to herbicides, the production of effective 
herbicides becomes increasingly important 

Unfortunately, herbicides that exhibit greater potency, broader weed spectrum and more 
rapid degradation in soil can also have greater crop phytotoxicity. One solution applied u this 
problem has been to develop crops which are resistant or tolerant to herbicides. Crop hybrids- or 
varieties resistant to the herbicides allow for the use of the herbicides without attendant risk of 
damage to the crop. Development of resistance can allow application of a herbicide to a crop 
where its use was previously precluded or limited (e.g. to pre-emergence use) due to sensiuvity 
of the crop to the herbicide. For example. U.S. Patent No. 4,7b 1 .373 to Anderson et oL is 
directed to plants resistant to various imitUzolinonc or sulfonamide herbicides. The resistance is 
conferred by mn altered acctohydrojyicid synthase (AHAS) enzyme. U.S. Paiem No 4,975.374 
lo Goodman et til. relates to plant cells and plants containing a gene encoflux; a mutant glutaminc 
synthetase (GS> rc*i*unt to inhibition by herbicide* that were known to inhibit CS. e.g. 
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phosphmeduieifl and rmatworune Hilfournine. U.S. Patent No. 5,013.659 id Bedbraok #» a/, is 
dirrcied to plants thai exprc*. a rmrunt acetolacute synthase which renders the plants resistant 10 
inhibition by sulfonylurea herbicides. VS. Patent No. 5.162,602 to Somen ei at. discloses plants 
tolerant to inhibition by cyctohciinrrtionc and aryUttyphenoxyprapinoic acid herbicides. The 
tolerance is confentd by an altered icety) coenzyme A eato;ylase(ACCaie). 

The protox enzyme serves as the target fer a variety of herbkid*! compoimds. The 
herbicides that inhibit protox include many difTerent structural classes of molecules (Duke et aL, 
WeedScL 3* 46S (1991); NindihaMi er oL. Pesticide Bhchem. PhysioL 43: 193 (1992); 
Matringe et of., FEBS Lett. 245: 35 ( 1989); Yamse md Anion, Pesticide Biochem. PhyjioL 35: 
70 (1989)). These herbicidal compounds include the diphcnylethos (ex aejfluorfen, $-f> 
chloro+(trinuoromethyl)rjr«r*^ acid; its methyl ester: or oxyfluorfen. 2-cWon>. 

M3«ihoxya-niut^iMKy)^uifluotobenzene)). nxidiizote, (e.g. axidiazon, 3<[2.4»dichJon> 
5^^lt«alyl«ho*y)phCTyl]-S-(l.'-di^le^:lylethyl^■ I »J,<-oiwdi»iol-2-( j/0-one), cyclic jmides (eg. 
S-23 1 42. AT^4^Wo«>-2.nHonv5^iop*rsy losypheny l)-3.4^,6H«fihyilropl»thaJiit«de: 
chlofophihalim, A/-(4-chlorophBiylh3,4 J,6Het«hydroph*«Jimioe). phenyl pyrazoles <c.& TNPP- 
ethyl. ethyl 2-(H23,4^hlc«Ophenyl)^nitropyrazo^ M£B 39279). 

pyridine derivatives (e.g. L$ 82-5561, and phenopylaie and its O-phenylpyTTOlidino- and 
pipen diMcaitHunaie analogs. Many of these compounds competitively inhibit the normal reaction 
catalyzed by the enzyme, apparently acting as substrate analog*- 

Typically, the inhibitory effect on pnxox is determined by measuring florescence at about 
622 io 635 fiM. sftcr excitation si about 395 to 410 nM (see, e g. Jacobs and Jacobs. Enyz/ne 28: 
206 1 1982); Sherman tl al.. Plant Physiol. 97: 280 1 1191 )). This assay is based on the face thai 
protoporphyrin IX i» a fluorescein pigment, and protoporphyrinogen IX is nonfluorescent. 

The predicted mode of action of proton-inhibiiing herbicides involves the accumulation of 
prmoporphyrinogen IX in the chlomplast. This accumulation is thought io lead to leakage of 
proiorjorvnyrinogen IX into the cytosol where it it oxidized by a peroxidase activity to 
protoporphyrin IX. When exposed io light, protoporphyrin IX can cause formation of singlet 
oxygen in the cytosol. This singlei oxygen can in turn lead to the formation of other reactive 
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oxygen species, which em cause lipid peroxidation and rnembrane disruption leading «o rapid ceil 
tfctih (Li* f/ai, Plant PhytioL 102: 881 (1993)). 

Not Mil prxxox enzymes arc sensitive to herbicides -nidi inhibit pUnt proton enzymes. 
Both of the protox enzymes encoded by renes twined from Escherichia coli (Sasarman er oi. 
Cdfl. / Microbiol. 39: 1 155 (1993)) and JfeaUiu suorUti (Diifcy er «i. /. £io£ Che*. 269: 813 
(1994)) ut resistant to these hcrbiridai irJiibitors. In addition, mutants of the unicellular alga 
Chhmydonumtas remharebii resistant to the phenylimide herbicide S-23 1 42 have been reported 
(Kaiaoka et at. J. PesScide Sd. 15: 449 (1990); Shibau ctdL.ln Rr»*"H iff p llfflf vs Y Tlt frfr ri *- 
Vol. m. N. Muraia, ed. KluwerNetberUnds. pp. 567-570 ( 1992)). At least one of these mutants 
aprcan to have an altered protox activity that is resistant not only to the hcrbieida] inhibitor on 
which the mutant was selected, but also to other clawes of pretax inhibitors (Oshio eiaL. Z 
Naiurfonch. 4Sc: 339 (1993); Sato el oi.. In ACS Symposium on Pprphyrig femicide* . S. Duke, 
ed. ACS Tress: Washington. D.C. (1994)). A mutant tobacco cell line has also been reported thai 
is resistant to the inhibitor S.21432 (Che rl aL. Z Notvrforsck 4ic: 350 (1993). In addition, 
modified, inhibitor-resisunt farms of plant proton coding sequences have been described in 
international application no. PCT/TB95/0O45: Died June 8, 1995. published Dec. 21. I99S as WO 
95/34659. 



III. Regulation of Proton Gene Expression 

The bulk of the research related to the protox gene which has been conducted thus far 
ha* focused upon the coding sequence and modifications to this enzyme whicn may render it 
res) sunt to protoi inhibitors. No information is available in the an with regard to the regulatory 
elements which control and promote the expression of protoi coding sequences in plants. 



SUMMARY OF THE INVENTION 

The proem invention it based on the discovery that the proro mcx regiam naturally 
associated with the plant crotoytxphyrinogen oxidate (protox) coding icqvicnces. referred to 
heron generally *» dtt "protox promoter", tie useful for promoting expression of a hct&oJogoui 
oodinf icquence hi i plsxo. 

In accordance with this discovery, the pitsem invention provides «n uotnadDN A 
molecule comprising i plant protox promoter. The present invention further provides • diimenc 
gene comprising a plant protox promoter operably linked 10 1 heterologous coding sequence. 
PUw tiwue and plants containing «>eh a chimeric gene are also provided. 

In one aspect of the invention the protox promoter is used to express herbicide resistant 
forms of herbicide target proteins in a plant to confer tolerance to the herbicide. According to 
this aspect, the protox promoter may be operably linked to a coding sequence far a herbicide* 
rttiaam plant protox protein which is resistant to inhibitors of unmodified plant protox protein. 
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SEQ ID No. 1 : DNA eodmc sequence for an Arabtdopxis thaliana protox- 1 protein. 



2 ID No. 2: Arabidopsis thaliana proto*-1 amino acid sequence encoded by SEQ ID No. 



I. 

SEQ ID No. 3: DNA coding 



for an Arabidopsis thalia 



k-2 protein. 



M>. SEQ ID No. 4: Arabidopsis thaliana protox-2 amino acid sequence encoded by SEQ H> 
No.3 

SEQ ID No. 5; DNA coding sequence for a maize protox -I praein. 

SEQ ID No. 6: Maize protox'l amino acid ^ucwx encoded by SEQ ID No. 5 

SEQ ID No. 7; DNA coding sequence for a maize prou»-2 protein. 

15 SEQ ID No. 8: Maize protox-2 ammo acid sequence encoded try SEQ ID No. 7 

SEQ ID No. 9: DNA coding sequence for a wheal proton* 1 protein. 

SEQ ID No, 10: Wheat prouu- 1 amino acid sequence encoded by SEQ 03 No. 9. 

SEQ ID No, 1 1: DNA coding sequence for a soybean protoi- 1 protein. 

SEQ ID No. 12: Soybean proton- 1 protein encoded by SEQ ID No. 1 1 . 

30 SEQ ID NO. 1 3: Promoter sequence from Anbidnpili thaliana protox- 1 gene. 

SEQ ID NO. 14: Promoter sequence from Zea mays (maize) proton- 1 gene. 



DEFINITIONS 

As used herein • "pint pretax promoter" is used to refer to the regulatory repot, which 
naturally occurs immediately upstr eam or • protoporphyrrnogen oxidase (protox) coding sequence 
in a plant and is responsible, in its naturally occurring state, for regulating the transcription of the 
associated protox coding sequence. The plant protox promoter includes the DNA region directly 
involved in binding of RNA polymerise 10 initiate transcription and additional upstream 
regulatory cis-elements which influence the transcription of an openbly linked coding sequence. 

As used herein a 'gene' is used to refer to a DNA molecule which includes ( 1 ) a coding 
sequence and (2) associated regulatory regions which promote and regulate the mn<cripdon of 
the coding sequence in a suitable fiost celt. TSe coding sequence may encode ■ useful o*n*u-ipt 
(e.g. antisense RNA) or polypeptide produced by translation of the encoded transcript A jeue 
includes at a minimum, in 5 -3' orientation, a promoter region. a coding sequence and a 
transcription term.. Hot- A gene may also include additional regulatory regions which can occur 
as pen of the minimal elements (e.g. leaden or signal peptides within the coding sequence) or as 
discrete elements (e.g. mtrons). 

A* used herein a "chimeric gene* refers to a gene which docs not naturally occur 
wherein ml least one component put is heterologous with respect to another component pan. As 
used herein to describe the present invention a "chimeric gene" refers to a gene which includes the 
promoter of the invention operably linked to a heterologous coding sequence. 

As used herein with reference lo the relationship between a promoter and a coding 
sequence, the term "heterologous' is used to refer to a relationship which does not naturally 
occur. For instance . a coding sequence is considered heterologous with respect to a promoter 
*quenee if ■'. is different from the coding sequence that naturally occur* in association with me 
promoter sequence. This include* modified for"-' of coding sequences which are naturally 
associated with a subject promoter. Accordingly, a modified, inhibiior-rcsisunl protox coding 
tequence is considered to be heterologous with respect to the promoter that is naturally 
a^iocUucd with the unmodified. inhibiior~;>emilivc form of this coding sequence. 



As used herein, the term "subsUnuaJ sequence homology" is n*ed lO indicate that i 
nucleotide sequence (in the cue of DNA or RNA) or an amino acid sequence (in the ewe of a 
protein or polypeptide) exhibits substantial structural and functional equivalence with mother 
nucleotide or eminoacid sequence. Any functional or structural differences between sequences 
having substantial sequence homology wit) be de minimis; thai is they will not affect the ability of 
the sequence to function as indicated in the present application* For example, a sequence which 
has substantial sequence homology with a DNA sequence disclosed to be a plant precox promoter 
will be able to direct the same We) and pattern of expression o/an associated DNA sequence as 
the plant protot promoter. Sequence* that have substantial sequence homology with the 
sequences disclosed herein are usually variants of the disclosed sequence, such as mutations, but 
may also be synthetic sequences. Structural differences ant cjnsidered de minimis if then is a 
significant amount of sequence overlap or similarity between two or more different sequences or 
if the different sequences exhibit similar physical characteristics. Such characteristics can include, 
for cample, immunological reactivity, enzyme activity, structural protein integrity, etc. 

Two nucleotide sequence* may have substantial sequence homology if the sequences 
nave ai least 70 percent, more preferably 80 percent and most preferably 90 percent sequence 
similarity between them. Two amino acid sequences have subsuntial sequence homology if they 
have at least SO percent, preferably 70 percen\ and most preferably 90 percent similarity between 
the active portions of the polypeptides. In the we of promoter DNA sequences, "subsuntial 
sequence homology" also refers to those f ragmen is of a promoter DNA sequence that are able to 
operate to promote the expression or associated DNA sequences. Such operable fragments of a 
promoter DNA sequence may be derived from the promoter DNA sequence, for example, by 
cleaving Ihe promoter DNA sequence using restriction enzymes, synthesizing in accordance wilh 
the sequence of the promoter DNA sequence, or may be obtained through the use or PCR 
technology. Mullis et ai.. Me th. En/ymoL 155:335-350 (1987): Eriich (ed). PCR TcChnolgEy* 
Stockton Pre** (New York 1989). 

A promote DNA sequence is said iv be "operably linked' io a second DNA sequence if 
the two are situated such that the promoter DNA sequence influences the Udnscripuon or 
translation of the iccond DNA sequence. For example, if the second DNA sequence codes for 
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the production of a protein, the prufloter DMA sequence would be operabty linked (0 the second 
ONA sequence if the promoter DNA sequence affects the expression or the protein product from 

second DNA sequence. For example, in a DNA sequence comprising a promoter DNA 
sequence physically attached to a coding DNA sequence in the same chimeric construct, the two 
sequences ere likely to be operabty linked. 
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DETAILED DESCRIPTION OF THE INVENT; ON 
The present invention relates id promoter DNA sequences which are naturally 
' associated with coding sequence* for punt protoporphyrinogen oxidase (referred to herein m& 
-pwoi-; are interoalkHuJ ipplieatiofi no. PCT/IB9VO0452 filed June 8. 1995. published Dec. 21. 
1995 as WO 95/34659 and co-pending provisional epplicsiktt entitled - DNA Molecules 
Encoding Plant Prwoporphynnogen Oxidase and Inhibitor Resistant Mutants Thereof" Tiled on 
the same day at the instant application). These pioio«pronM^seqt«nces have been fotuidto be 
, useful for the expression of » heterologous coding sequence in a plartf- 

The promoter sequence for the v4 ruhidopiu tkalionn prown-l coding sequence (SEQ 
ID No. 1 ) is provided as SEQ ID No. I J. Isolation of this promoter from a genomic library using 
the associated coding sequence is * probe is described in Example I. The promoter sequence Tor 
the maiic protox- 1 coding sequence (SEQ ID No. 5) is provided as SEQ JD No. 14. Isolation of 
this promoter from a genomic library using the associated coding sequence as a probe is described 
in Example 4. 

The approach used 10 isolate the Ambidopsu and maue protox- 1 promoters can be 
u.y;d io isolate the promoter sequence from any plant protox ge'ie. Any protox coding sequence 
« hich share* iufficient homology to hybridize io the protox coding sequence associated with the 
promoter of interem may be u*ed as a probe in this approach. Since the respective proton- 1 and 
protox-2 coding sequences from all plants arc contemplated to share this requisite degree cf 
homology, the choice of which orotox coding sequence is used as a probe is not considered 
critical. However, for optirr.il hybridization results it is preferable to use the most closely related 
protox coding sequence. Most preferably, the coding sequence Used as a probe is from the some 
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plant species is the precox promoter «f interest and is the coding sequence Morally asso c iate d 
with the promoter. 

The plant precox pwmoter of the present invention includes the Anbidopsii protox- 1 
promoter sequence set forth in SEQ Id No. 13 as well as contsponding precox- 1 promoter 
sequences available from other plant species a» indicated above. The present invention also 
. includes functional fragments of these ON A sequences which retain the ability lo regulate 
ciprcuMTi of an operably linked coding sequence in the same manner as the exemplified protox 
promoter sequence. Such functional fragments may be identified through deletion analyses or 
other standard techniques used in the art to identify protox promoter activity {sec, e.g. pages 546- 
549 of 'Genes TV~.ed.by Uwin, Oxford Univ. Press (1990)). The present invention also 
include? ON A sequences having substantial sequence homology with the rrotox promoters 
available from plant genes which confer an equivalent level and partem of expression upon an 
operably linked sequence. Such promoter sequences may be obtained through modification of the 
protox promoters isolated from plant genes and are considered functionally equivalent derivatives 
of the plant protox promoters. 

At illustrated in the examples below, the DNA sequences, vectors and transgenic plants 
of the present invention comprise a promoter sequence derived from a plant protox gene. The 
protox promoter DNA sequence* aic preferably linked operably lo a coding DNA sequence, for 
example a DNA sequence which is transcribed into a useful RNA transcript v*ch as an antjsense 
transcript, or a coding sequence which is ultimately expressed in the production of a useful 
procein product. 

In a preferred embodiment, the protox promoter is used lo direct the expression of a 
modified herbicide target enzyme which is KMHiani to herbicides at levels that inhibit the 
'-orrciponding unmodified version of the enzyme. Such modified herbicide-resistant enzymes 
include herbicide -re*i*unl forms of imidazolcglyccrol phosphate dchynuse (IGPD; get WO 
9426909 pubhthed Nov. 24. 1994). EPSP 'ynthase iste U.S. Pat. No*. 4 .535.060: 4,769.06 1 ; 
4.940.K33 and EP 550.633). gluumine tyniheta* (CS; ut U.S. Patent No. 4.975 J74). acetyl 
coenzyme A carboxylase! ACCa*; tte U.S. Patent No. 5.162.602). and «cetolactate synthase 
Or* U.S. Patent Nos. 4.761.373; 5.304.732; 5.331,107; 5.013.659; 5. 14 1. »70;-and 5.378.824). 
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In t most preferred embodiment, the protox promoter ii used u> direct the expression of * 
modified pnxoi enzyme which is resistant to prouu inhibitor* as illustrated in Examplci 20 (*«* 
ai*> International application no. PCT7TB9MXW52 Hied June 8. 1995. published Dec. 21, 1995 u 
WO 95/3*659 whose relevant parts arc herein meorporated by reference; tee also co-pending 
application entitled ' DNA Molecules Encoding Plant Protoporphyrinogen Oxidase and Inhibitor 
Resistant Mutants Thereof Hied on the s*me day as the taunt application). 

The transgenic plants of the present invention may be traittfacmed by any method of 
transformation known in the an. These methods include, for instance, trtnsfomttion by uiitct 
infection or axulti vaiion oi plants, plant tissue or ceils with Agnbaaeriian tumefae ienr, Honeh 
et aL. Science. 225: 1229 (1985); Marton. 'Cell Culture and Som ik Cell Ccnetic of Plants", vol 
I, pp 514-521 (1984): dinct gene transfer into protoplasts; Pisikowsij et d., EMBO J. 12: 2717 
(1984); Uera etaL. MoL Gen. 4 Genet. Il99i\7& (1985); Fromm etaL. Nature JI*W (1986): 
microprojeeiile bomh»«imcnt, Klein et aL. Biortechnology, 6:559-563 ( 1988); injection in.n 
protiiplausculiured cells »nd tissues. Reich etal.. Bio/Technology. *IOOMOM(l98c);or 
injection into nieristemaijc tissues of seedlings and plants as described by Oe U Pena el aL. 
haiurt. J25:274-27o (1987); Hooykaas-Van Stogieren n aL. Nature. J//:76l-764 (1984); 
Criimley el al.. Biotechnology, d: 1 85 (19881: and Gnmsley et a!., Nature. 325: 1 77 ( 1 988). 

The invention it illustrated in more detail by the following examples, without implying 
any reuncuon to wtuu \\ described therein. 
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EXAMPLES 
EXAMTLE 1: laolrttaaoftbeAnbkloiiis 

A Lambda Zap D genomic DNA library prepared fmm Arabidopsis thaliana (Columbia, 
5 whole plant) was purchased from Straugenc. Approximately I2S.000 phage were plated at a 
density of 25X00 pfe (plaque forming units) per 15 cm Petri dish and duplicate lifts were made 
onto Colony/PUque Screen membranes (NEN DupOnt). The plaque lifts were probed with the 
Arabidopsis Protox- 1 cDNA (SEQ ID No. 1 labeled with 32P«dCTP by the random priming 
Method (Ufc Technologies). Hybridization and wash conditions were at 65° C as described in 
to Church and Gilbert, Proc. Naii. Acad. So. USA 81: 1991-1995 (1984). Positively frybrir&tifig 
plaques were purified and in vivo excised into pBluescript pUsmids. Sequence from the genomic 
DNA inscas was determined by the chain termination method using didcoxy terminators labeled 
with fluorescent dyes (Applied Biesvstems, Inc.). One clone, AraPTIPro, was determined to 
contain 580 bp of Arabidopsis sequence upstream from the initiating methionine (ATG) of the 
l S Protox- 1 protein coding sequence. This clone also con tains coding sequence and introns lhai 
extend to bp 1 24 1 of the Protox* 1 cDN A sequence. The 580 bp 5 noncoding fragment is the 
putative Arabidopsis Protox- 1 promoter, and the sequence is set forth in SEQ ID No. 1 3. 
AnPTlPro was deposited December 14. 1995. as pWDOl 1 (NRRL #B-21S15) 

20 

EXAMPLE 2: Construction of plant rrensfonnatloa vectors expressing altered 
Protm-l genes behind tbe native AnMdepcts Protox-1 promoter 

A full-length cDN A of the appropriue altered Arabidopsis Proiox-1 cDNA is isolated as 
23 «n EcoRI-Xhol partial digest fragment and cloned into the plartl expression vector 

pCON 1 76 ) ENX (see Example 9 of International application no. PCT/ffi95/00432 riled June 8, 
1 995. published Dec. 21. 1995 as WO 95/34659). This plasmnj is digested with Ncol and BamHI 
to produce a fragment comprised or the complete Protox- 1 cDNA plus a transcription terminator 
from the 3' untranslated sequence of the '-ml gene of A&robacferium tumefaciens- The 
M AraPT I Pro plasmid described above is digested with Ncol and BamHI to produce a fragment 
comprised of pBluescript and the 580 bp putative Arabidopsis Protox- 1 promoter. Ligation of 
these two fragment* produces a fusion of the Itered protox cDNA to the nuive protox promoter. 
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The c*prw»on cuaene containing the Protox- 1 promoter/ProfC*-! eDNAAml termjnaux fusion 
is excised by digestion with Kpnl and cloned into the binary vector pCllttUO. The binary piasrnjd 
is transformed by ctectroporation into Agrobacterium and theft into Anbidopsis using the vacuum 
infiltration method (Bechwld el al CR. Acad. ScL Paris 316: 1194-1199 (1993)). 
Tnrcfomunts expressing Altered protox genes are selected on kanamycin or on various 
concentrations of protox inhibiting heoicide. 



EXAMPLE 3: 



Protox- 1 promoter/aAmd Protox-1 fustaa 



Using the procedure described above, an Anbidopsis Protox- 1 cDN A containing a 7 AC 
to ATG (Tyrosine to Methionine) change at nucleotides 1306-1308 in the Protox-1 sequence 
(SEQ ID No.l ) was fused io the naiive Proton- 1 promoter fragment and transformed into 
Anbidopsis lhaliona. This altered Protok-I enzyme (ArtC-2M«) has been shown to be >10fold 
more tolerant to various rmtox-inhibiting herbicides than the naturally occurring enzyme when 
tested in a bacterial expression system (see Example 5 of copending US. application entitled " 
DNA Molecules Encoding Plant PrDtoporphyrinogen Oxidase and Inhibitor Resistant Mutants 
Thereof filed on the same day as the instant application). Seed from the vacutun ir filmed plants 
was collected and plated on a range ( lO.OnM- 1 .OuM) of » protox inhibitory aryluracil herbicide of 
formula XVII. Multiple experiments with wild type Anbidopsis have shown that a 10-OnM 
(.oiweniidiion of this compound is sufficient to prevent normal seedling germination. Transgenic 
iced* expressing the ArtC-2Mci altered enzyme fused to the native Protox- 1 promoter produced 
normal Arabidopsis seedlings at herbicide concentrations up to 500nM. indicating at least 50-fold 
higher herbicide tolerance when compared to wild-type Arabidopsis. This promoter/altered 
protox enzyme fusion therefore function* as an effective selectable marker for plant 
transformation. Several of the plants that germinated on 100 OnM of protox •inhibiting herbicide 
were transplanted to toil, grown 2-3 weeks, and tested in a spray assay with various 
concemnuions of the proiox-inrubiting herbicide. When compared to empty vector control 
(ransformants, the AraPTiPro/AraC-2Met transgenics were >lOfold more tolerant to the 
herbicide spray. 
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SAMPLE 4: tail^rfiM^Prolai-lpwiP"^^*-** 

A Zci Mays (Missouri 17 inbrtd. eri latcd seedlings) genomic DNA library in the Lambda 
ronvectOTWispwch^df«>mStrti»««K. Apprc*muuely 230.000 pfu of the i-hrary *«s 
plated it t density of 50,000 phage per 15 em plate end duplicate life were iiiade onto 
Colony/Plaque screen membranes (NEN Dupont/.The plaque lifts wen probed with the maize 
Pnxox-1 cDNA labeled with 32P-dCTP by the random priming method (life Technologies). 
Hybridization and wash conations wwr at 65oC as described in Church and Gilbert, froc. Natl 
Acad. ScL USA 81: 1991-1995 (1984). Lambda phage DNA was isolated from three positively 
hybridizing phage using the Wizard Lambda Preps DNA Purification System (Promega). 
Analysis by restriction digest hybridizaaon paacms, and DNA sequence anary»U identified a 
Umbda clone conuirung approumaicly 3.5 kb of maize genomic DNA located 5' to the rnaitt 
Protoa-l coding sequence pieviously uolatcd « a cDNA done. This fragment is contempUted to 
include the maueProtos-l promoter. The sequence of this fragment is set forth in SEQ ID NO 
14. From nucleoli** I to 353Z this sequence is comprised of 5' noocoding sequence. From 
nucleotide 3533 to 3848. this sequence encodes the 5" end of the maize Prwox-1 protein. 

A plasmid containing the sequence of SEQ ID NO- 14 fused to the remainder of the maize 
Prowa-1 coding sequence was deposited M«h 19. 1996 as pWDC.U(NRHL«B-21546). 

EXAMPLE 5; Construction of riant TrWormatioo Vector* 

Numerous transformation vectors are available for plant tnnsfornuuion, and the 
promoters and chimeric genes of this invention can be used in conjunction with any such vectors. 
The selection of vector for use will depend upon the preferred transformation lechnique and the 
target tpeeies for transiormauon. For certain target species, different antibiotic or herbicide 
selection markers m-y be preferred. Selection markers used routinely in transformation include 
(he nptll gene which confers resistance to kanamycin and related antibiotics (Messing & Vierra. 
Gei* 19: 259-268 (1982); Bevan ei aL. NoJn* JW.IB4-187 (1983)). the bar gene which confers 
resistance w the herbicide ph«phin«hridn (White « aL. Suet Acids Res 18: 1062 ( 1990), 
Spencer et al Theor Appl Genet 79: 625-631(1990)). the hph gene which confers resistance to 
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Ihe antibiotic hygiomyc'm (Blochingcr & Diggehnaan. Moi Cetf Biai 4; 2929-2931). *nd the dhfr 
gene, which confers rcsisunce to methorrexaie (Bourouis et al„ EM BO J. 2(7): 1099-1 104 
(1983)). 

( l ) Construction of Vectors Soiublc for Agrobaaerwn Tranrfarmaticc 
Many vectors are available for tmsfomution using Agrobacteriimt twfaciau. These typically 
carry ii lew one T-DNA bolder sequence and include vectors such as pBW19 (Bevan, AteL 
Acids Res. (1984)) sod pXYZ. Below the construction of rwo typical vectors is described. 

extraction of pOBZOP. md pCIBMOl 

The biiwy vectors pCIB200 ind pClB2O01 ire wed for the construction of recombinant 
vectors for use wilh Agrvbaaerium and was constructed ir. the following manner. pTJS75k*n 
w K etcated by Narl digestion of pTJS75 (Schmidhaiuer ft Helinsku J BaaerioL 164: 446455 
( 1 985)) allowing excision of the tctracycline-resisunce gene, followed by insertion of an Ace! 
fragment from pUC4K carrying an NFTTJ (Messing & Vierra, Cent I* 259-268 (1982); Bevan et 
at.. Nature 304: 184-187 (1983); McBride el aL. Plant Molecular Biology 14: 266-276 (1990)). 
Xhol linkers were ligtied to the EeoRV fragment or pOB7 wfcich coitams the left and right T> 
ON A borden. a plant telecuble nosfoptll chimeric gene and the pUC polylinker (Rothstein et aL, 
Gene S3: 153-161 (1987)), and the Xfo/Kligested fragment w*i cloned into Satf-digested 
P TJS75kan to create pOB200 tsee also EPO 332 104. example 19 (13381). pCIB200 contains 
ihe following unique polylinker restriction sites: EcoRI, Sid Kpnl, Bglll. Xbal, MnASall. 
pCIMQOt is a derivative of pdD200 which created by the insertion into the polylinker of 
additional icstriction sites. Unique restriction itlei in the polylinker of pCIB2001 are EcoRl. Sstl, 
Kpnl, Bglll. Xbal. Sail MM. BcU, Avrll. Apal, Hpal. and Sttil. pCD32001. in Addition to 
containing these unique restriction sites also has plant and bacterial karumycin selection, left and 
right T-DNA borders tot Agrobacteriwm-nxfuied transformation, the RlU-demed of A function 
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for mobilization between ELeoUmd other hostt, and the DriTmi OriVfvnaivrutlto from RK2. 
The pCX&2001 polylink=r is suitable for the cloning of plant expression c 
own regulatory signals. 



Cpnaructicn of pCB? ]Qm4 Hygrgmycin Election PcriYWrwa thereof 

The binary vu^pCTOlO cowans i gene eneodoflg Icaaamycin rcsiaraeeforielectior) in 
plants, T»DNA right and left border sequence* and incorporates sequences from the wide host- 
nnge plasmid pRK252 allowing it to replicate in both £ coii and AgnbeOKriim. Its 
construction is described by Rothsfein «/ aL, Gene 53'. J53-161 ( 1987). Various derivatives of 
pCIBIO have been constructed which incorporate the gene for hygromycin B phosphotransferase 
described by Gritt ctal. Gene 25: 179-188(1983)), These derivative* enable selection of 
transgenic plmi cell* on hygromycin only (pQB743), or hygrornycui and kanamycin (pOB7i5, 
POB717). 

( 2) Construction or Vectors Suitable for non-Agrobacteriwn Transformation. 

Transformation without the use of Agnbocterium tumefaciens circumvents the 
requirement for T-DNA sequences in the chosen transformation vector and consequently vectors 
lacking these sequencer can be inilued in addition to vectors such as the ones described above 
which contain T-DNA sequences. TransforrnaiioA techniques which do not rely on 
Agrobacurium include transformation vis panicle bombardment, protoplast uptake icg. PEG and 
elccuoporation) and microinjection. The choice of vector depends largely on the preferred 
selection for the species being transformed. Below, the construe lion of some typical vectors is 
described. 



Conimictign,oroQB3Q64 

pCIB3064 is a pUC-derived vector suitable for direct gene transfer techniques in 
combination with selection by (he herbicide basta (or phosphincthricin). The puuraid pC!B246 
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comprises the CaMV 35S promoter in opemionaJ fusion to the £ c U GUS gene and the CaMV 
35S mnscripoonal terminator and is described in the FCT published application Wo 934)7278. 
The 35S promoter of this vector contains two ATG sequences 5' of the sat site. These sites 
were mulcted wins stendtrd PCR techniques in such a way as to remove the ATG* and generate 
the restriction sites Stpl and Pvull. The new restriction sites were 96 and 37 bp away from the 
unique Sail site and 101 and 42 bp away from the actual start site. The resultant derivative of 
pCIB246 was designated pQB302S. The GUS gene wis then excised from pQB3025 by 
digestion with Sat] and Sac!, the termini rendered blunt and religatcd to generate plasmid 
pCTB3060. The plasmid pJIT82 was obtained from the John lnncs Centre, Norwich and the 400 
bp Smal fragment containing the oar- gene from Streptomyee? viridochromogena was excised 
and inserted into the fipat site of 0093060 (Thompson « aL EMBO J fi: 2519-2523 (1987)). 
This gtnerved pCIB3064 which comprises the bar gene under the control of the CaMV 35S 
promoter and terminator for herbicide selection, a gene fro ampicillin resistance (for selection in 
£ cult) and a pojylinkcr with the unique sites SphJ, fsU, timdll!, and BamHl, This vector is 
suitable for the cloning of plant expression cassettes containing their own regulatory signals. 



pSOG35 is * traiufonnation vector which utilizes the £ coU gene dihydrofolaie reductase 
(DHFR) as a selectable marker conferring resistance to methotrexate. PCR was used to amplify 
die 35S promoter (-800 bp), intron 6 from the maize Adh I gene ('550 bp) and 1 8 bp of the GUS 
untranslated leader sequence from pSOGlO. A 250 bp fragment encoding the £ call 
dihydrofolaie reductase type II gene was also amplified by PCR and the* two PCR fragments 
were assembled with a SachPnl fragment from pBI22t (Gonlech) which comprised the pUC19 
vector backbone and the nopaline synthase terminaior. Assembly of these fragments genemed 
pSOG 1$ which contains the 35S promoter in fusion with the intron 6 Sequence, the GUS leader, 
the DHFR gene and the nopaline synthase terminator. Replacement of the GUS leader in 
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pSOCIQ with the leader sequence from Maize Oil«ratic MoOfe Viw generated the 

vector pSOG35. pSOG!9 cod pSOG35 carry the pUC gene for ampicillin resistance and have 
Hindlll, Sphl, Pal and EcoW sites available for the cloning of foreign sequence such as chimeric 
gene sequences containing a plant protox promoter. 



EXAMPLE 6i 

Coding sequences intended for expression in transgcHC plants under the control of a plant 
protox promoter may be assembled in expression cauenes behind a suitable precox promoter and 
upstream of a suitable Description terminator. The resulting chimeric genes can then be easily 
transferred to the plant transformation vectors described above in Example 19. 



^3 



Erotgi Prompter Sctaiigt 

In accordance with the present invention, the chimeric gene v, 
promoter. The election of the specific protox promoter used in the chimeric gene is primarily up 
to the individual researcher, although gcneftlly it will be preferable to use a proWK promoter from 
a plant species closely related to. or most preferably identic.), to the species intended to contain 
the resulting chimeric gene. For example, if the chimeric gene is intended 10 be contained in a 
mau* plant it would be preferable to use a protox promoter from a monocotyledcnous plant and 
most preferable to use a maize protox promoter. 

TrMtcriBtitmal Tcrminnon 

A variety of transcriptional terminators are available for use in expression cassettes. These 
are responsible for the termination of transcription beyond the transgenc and its correct 
polyadenylation. Appropriate transcriptional terminators are those which are known to function 
in plants and include the CaM V 35S terminator, the ml terminator, the nopalinc synthase 
terminator, the pea rbcS E9 terminator, at well as terminators naturally associated with the plant 
proms gene (i.e. "protox terminal on'). These can be used in both mono c otyledons and 
dicotyledons. 
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Numerous sequences have been found to enhance gene expression from witfpn the 
uuuerrptionaj unit and these sequences can be used in conjunction with the genes of this 
invenuon to increase their expression in transgenic plants. 

Various intran sequences have been shown to enhance expression, particularly in 
monocotyledonous cells. For example, the intron* of the miize. Kdh I gene have been found to 
significantly enhance the expression of the wild-type gene under its cognate prom o te ' when 
introduced into maize cells. Intron I was found to be particularly effective and enhanced 
expression in fusion constructs with (he chloramphenicol acetyhransf erase gene (Cillis o at, 
Ones Develop, 1: M83- 1200(1987)). In the same experimental system, the imroc from the 
mxue bronzcl gene had a similar effect m enhancing expression (CxJlis er oL, ntprv), Intron 
sequences have been rouuncly incorporated into plant trtnsformaoon vectors, typically within the 
non-translated leader. 

A number of non-muislaied leader sequences derived from viruses are alio known to 
enhance expression, and these particularly effective in dicotyledonous celts. Specifically, 
leader sequences front Tobacco Mosaic Virus (TMV, the "W -sequence'"). Maize Chlototic Mottle 
Virus (MCMV). and Alfalfa Mosaic Virus (AMV) have been shown to be effective in etuSancing 
expression {e.g. Gallic tt ol. Nuct. Acids Res. 15: 8693-87 11(1 987); Skuxeski er oL Plant MoUc. 
Biol 15: 65*79 (1990)) 

Various mechanisms for targeting gene products are known to exist in plants and the 
sequences controlling the functioning of these mechanisms ttave been characterized in some detail. 
For example, the targeting of gene products to the chloropiast is controlled by • signal sequence 
found at the amino terminal end of various proteins and which is cleaved during chloropiast 
import yielding the mature protein (e.g. Comai etal. J. BioL Chem. 263: 15 104-15 109 (1988)). 
These signal sequences can be fused to heterologous gene products to effect the import of 
heterologous products into the chloropiast (van den groeck tt aX, Nature J/J; 338-363 ( 1985)). 
DMA encoding for appropriate signal sequences can be isolated from the 5' end of the cDNAs 

19 
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encoding the RUBISCO protein, the CAB protein, the EPS? synthase enzyme, the G52 protein 
and many other protemi which arc known to be chloropUtt localized. 

Other gene products arc localized to other organelle* such as the mitochondrion and the 
peroxisome («.g, Unger et cL Plant Molee. BioL IS: 41 1-418 (1989)). The cDNAs encoding 
these products can also be manipulated to effect the targeting of heteroloBOUS gene products lo 
these organelles. Examples of such sequences are the nttcteu-encooed ATPases and specific 
aspartate amino transferase isoforms for mitochondria. Targeting lo cellular protein bodies has 
teen deathbed by Rogers el aL. Proc. Hall AauL ScL USA 82: 6512-651 6 (1985)). 

In addition sequence* have been characterized which cause the targeting of gene products 
to other cell compartments. Amino terminal sequences are responsible for targeting to the ER, 
the apoplast, and extracellular secretion from akurene cells (Koehkr & Ha Plant Cell 2: 769- 
783 (1990)). Additionally, amino terminal sequences in conjunction with carboxy terminal 
sequence! arc responsible for vacuolar targeting of gene products (Shinshi el el. Plant MoUc 
. BioL 14: 357-368 (1990)). 

By the fusion of the appropriate targeting sequences described above to transgene 
sequences of interest it is possible to direct the transgene product to any organelle or cell 
compartment.. For chioraplasl targeting, Tor example, the chloroplast signal sequence from the 
RUBISCO gene, the CAB gene, the EPSP synthase gene, or the G52 gene is fused in frame to the 
amino terminal ATG of the transgene. The signal sequence (elected should include the known 
cleavage site and the fusion constructed should take into account any amino acids after the 
cleavage site which are required for cleavage, (n some cases this requirement may be fulfilled by 
the addition of a small number of amino acids between the cleavage site and the transgene ATG 
or alternatively replacement of ume amino acids within the transgene sequence. Fusions 
constructed for chloroplast import can be tested for efficacy of chloroplast uptake by in vitro 
translation of in vitro transcribed omstnictions followed by in vitro chloroplast uptake using 
techniques described by (Bartten et at. In: Edelmann et ai (Eds.) Methods in Chloroplasi 
Molecular piology . Elsevier, pp 1081. 109 1 (1982); Wasnunn rial. MoL Gen Genet. 205: 446- 
4)3 ( 1986)). The* construction techniques art well known in the an and ait equally applicable 
to mitochondria and peroxisomes. The choice of targeting which may be required for expression 
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J the m/ugcncs will depend on die cellular localization of the precursor required as the starting 
point for i given pathway. This will usually be cytosofcc or chjoroplastic, although it may is «xnc 
cases be rohochondrial or peroxisomal. The products of transgene expression will not normally 
require targeting to the ER. the apopkut or the vacuole. 
5 The above described mechanisms for cellular urgcxing can be utilized in conjunction with 

plant protox promoters so as to effect a specific eel] targeting goal under the transcriptional 
regulation of a promoter which has an expression pattern different to that or the promoter from 
which the targeting signal derives. 

10 EXAMPLE* TrsMfomalioa of Dicotyledons 

Transformation techniques for dicotyledons are well known in the an and include 
Airobacre rii«n*bssed techniques and techniques which do not require Agrobocterium. Non- 
Atrvlwitrium techniques involve the uptake of exogenous genetic material directly by 
protoplasts or cells. This can be accomplished by PEG or electroporaiion mediated uptake, 

IS particle bomhi/dmem-rne^iated delivery, or microinjection. Examples of these techniques are 
desttbed by PaszkowskJ ti ai, EM BO J 3: 2717-2722 (1984). Potrykus el at.. Mot Gen. Genet. 
199: 169-177 (1985V Reich et aL Biotechnology 4: 1001-1004 (1986). and Klein etal.. fiaaift 
327: 70-73 (1987). (n each case the transformed cells are regenerated to whole plants using 
standard techniques known in the an. 

20 Agrobacterium'ne^uai. transfbmuiion is a preferred technique for transformation of 

dicotyledons because of its high efficiency of transformation and its broad uuliry with many 
different species. The many crop species which are routinely transformable by Ajtrobaaerium 
include tobacco, lonulo, sunflower, cotton, oilseed rape, potato, soybean, alfalfa and poplar (EP 
0 317 511 (cotton), EP 0 249 432 (tomato, to Calgene). WO 87/07299 (Arosstca. to Calgene). 

2S US 4.795.855 (poplar)). 

Transformation of the target plant species by recombinant Agrohaeterutm usually involves 
co-culiivation of the Agrohacterimn with explains from the plant and follows protocols well 
known in the art. Transformed tissue a regenerated on selectable medium cwying the antibiotic 
or herbicide resistance marker prevent between the binary plasmid T-DNA borders. 
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EXAMPLES: TrsorfofroMiaa of Monocotyledons 

Transformation of mem monocotyledon species his now also become routine. Preferred 
techniques include direct gene transfer into protoplasts using PEC or eleuuoporatioA techniques, 
and particle bombardment into callus tissue. Ttansfomutioos cm be undenaken with a single 
DNA species or multiple DN A species (i.e. co-mnsfonTUnon) rod both these techniques are 
suitable for use with this invention. Co-tr*asfomuiion may have the advantage of avoiding 
complex vector construction and of generating transgenic plants wiih unlinked lou for (be f ene of 
interest and the selectable marker, enabling the removal or the selectable marker in subsequent 
generations, should this be regarded desirable. However, a disadvantage of the use of co- 
transfonnition is the less than 100% frequency with which separate DNA species are imegratcd 
into the genome (Schocber et al Biotechnology 4: 1093-1096 ( 1986ft. 

Patent Applications EP 0 292 435 (to Ciba-Geigy). EP 0 392 225 (l* Ciba-Geigy). WO 
93/07278 (to Ciba-Geigy) and US. P*l«it No. 5 .350.689 (to Ciba-Geigy) describe techniques For 
the preparation of callus and protoplasts from an elite inbred line of maize, transformation of 
protoplasts using PEG or clccuurjoraiiofl. and the regeneration of nutize plants from transformed 
protoplasts. Gordon-Kamm tt aL. Plant Cell 2:603-618 (1 990)) and Fromm er aL . 
Biotechnology 8: 833-839 ( 1 990)) have published techniques for transformation of A 1 88-dcri ved 
maize line using panicle bombardment. Furthermore, application WO 93AJ727S (to Cib*Oeigy) 
and Koziel « at.. Biotechnology 1 1: 1 94-200 ( 1 993)) describe techniques for the transformation 
of elite inbred lines of mai/e by panicle bombardment. This technique utilize* immature maize 
embryo* of 1 .5-2-5 mm length cacised from a maixe ear 14-15 days after pollination and a PDS- 
lOOOHe Biolistics device for bombardment. 

Transformation of >\z? can also be undertaken by direct gene transfer techniques utilizing 
proiopUu* or panicle bomwdment. Protoplast-mediated transformation has been described for 
ifl/x*ii«-iype» and /Wioa-types (Zhang tt aL. Plant Cell Rep 7: 379-384 ( 1988); Shimamoto et 
al. Nature 338: 274-277 ( 1989); Datu et al. Biotechnology 8: 735-740 ( 19*))). Both types air 
aho rouunely transformable using panicle bombardment (Cnristou et aL Biotechnology 9: 957- 
962(1991)). 
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it Application EP 0 332 5g| (10 Gba-Geigy) describes techniques for the 
tsformition end regeneration of Pooideae protop)asts. These techniques allow the 
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tnastemaum of DactyUt and whuL Further****, wheat in nsfontubon wis been described by 
Vasil t t oL Biotechnology Hh 667.674 ( 1 992)) using particle buwbardmetit into cells of type C j£ *• 

long-term regenerabk callus, and also by Vasil «i al. Biotechnology II: 1533*1558 (1993)) and ^ 
Week* er at. Plan/ PhythL 102: 10T7-1O84 (1993) using particle bcmUrdmaa of immature 
embryos and immawre embryo-derived callus. A preferred technique for wtteat transformation. 
^ however, involve* the t-wu formation of wheat by particle bombwdnwu of unmaiure embryos and JT^ 

)? v * includes either $ high sucrose or a nigh maltose step prior to gene delivery. Prior » y% 

Sid bombardment, any number of embryos (0.75*1 mm in length) are plated onto MS medium with \ < 
3% sucrose (Murashige ft Skoog, Physiologic Plantanon 15: 473-497 (I9«)) and 3 mg/1 2.4-D J! J 

for inductkm of wmalic embryos which is allows to proceed in the dark. On the chosen <Uy of if? «j 

bombardment, embryos are removed from the induction medium and placed oriio the osmoticum 
(U. fndueiion medium with sucrose or maltose added at the desired concennaticn. typically mm^i 
15%). The embryos are allowed to phumolyzc for 2-3 h and are then bombarded Twenty 
embryos per target plate is typical, although not critical. An appropriate gene -carrying pUsrnid -I; ^ 

(such as pClfi3064 or pSG35) is precipitated onto micrometer size gold panicles using standard ^ 
procedures Each plate of embryos is shot with the DuPont Bwlistics- helium device using a burst ~- 
prc&fcurc of - 1000 psi uurtg a standa/d 80 mesh screen. After bombardment, the embryos are 
• placed back inio the dark io recover for about 24 h (Mill on csmoticttm). Aflef 24 hrs. the 

embryo* arc removed from the ovnoucum and placed b»ck onto induction medium where ihcy " 4 
tuy for about a month before regeneration. Approximately one month later the embryo esplaflts * ; , ' 

with v\ sloping embryrsenic callus are imnsferrd to regeneration mediu.ii (MS ♦ I mg/liler [" 1 

NAA, 5 mg/lilcr GA>. further containing the appropriale selection agent (10 mg/1 basu in the case 
> of pCIB3064 and 2 mg/1 imthotreiate in the case of pSOG35). After approsiimately one month. 
dcvclupcd shoots arc transferred to larger sterile containers known as *GA7s" which contained 
half strength MS. 2% sucrose, and the same concentration of selection agent. Patent application £ji 
08/147,161 describe* methods for wheat transfomuuott and is hereby ineorporafa>1 by reference. 
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While the present rovwioo has been described wiih reference to specif smbodimenu 
thereof, it will be tppnxiittd that numerous vtrubons, nvxiificaaonv, and ewxiiinems uc 
possible, and Accordingly, all such variations, modifications and embodiments are to be recanted 
as being within the spirit and scope of the present invention. 



SEOOENCZ LISTXMG 



(ii) TITUL Of IMVlWnON: PROMOTERS TOOK PLANT PROTOPOR««RII»aM 
OXIDASE SZKES 

(ill! mnani of svqoqces: u 

(ivl CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: CitM-Gaigy Corporation / Patent Dept. 
(&l STREETS 540 Mhlt« PUtiU M. 

(C) CITY: TwrxytgwA 
(P) STATE: NY 
<£) COUNTRY: OSA 
<PI ZIP: 1059I-SOOS 

(V) COMPUTER READABLE FORM-- 

<A) KEOIUM TYPE: Floppy disk 

(8) COMPOTEA: IBM PC CORBMCifal* 

|C) OPERATING SYSTEM: PC-D0S/H5-D0S 

IP) SOFTWARE; P»C*ntIfl MlUPI #1.0, VaZwlon 11.35 

jvi) CURRENT APPLICATION DATA: 

IA) APPLICATION NUMBER: US TBA 

(B) FILINC DATE; 
<C) CLASSIFICATION: 

(vii) PRIOR APPLICATION DATA 

(A) APPLICATION NUMBER; US 08/261,199 

IB) FILING DATE; 16-JUN-94 
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iviii) ATTORNEY /AG BUT INFORMATION : 

lAI NAME; Elm, JlM» Seoet 

(Bl REGISTRATION NUMBER: 56,129 

(C) REFERENCE /DOCKET NUMBER: CGC 1846/provJ 

I IX) TELECOMMUNICATION INFORMATION : 
(A) TELEPHONE: 919-541-3614 
(E) TELEFAX: 919-S41-8C89 



(21 INFORMATION POM SEQ ID NOsl: 
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<B) TY7E: nucleic acid 
(C) STRANPEBSESSr alngle 
(0) TOPOLOGY: lin**r 

(ii> MOLSCULE TYPE: cfltfA 

(Uil HYPOTHETICAL: HO 

(iv) ANTI-StNSi: HO 



fix) FEATURE : 

(A) NAMfc/XEYi CDS 

<B) LOCATXONi 31.. 1644 

CD) OTHER INFORMATION: /note. *Arabideeail protojc-1 cEKA: 
a«Quence froa pHDC-2* 

(xi) SEQUENCE DESCRIPTION: SEQ ZD WO: I: 

ITT CCGAATTCTC TCCGATTTCC ATC SAO TTA TCT CTT CTC COT CCC 
M«c olu Leu S*r Leu Leu Aiy 
1 5 

P CAA TCC CTT CTT CCC TCC TTT TCG AAO CCC AAT CTC CQA TTA 
r Gin sar Liu L«u Pro 5*r Ph» Sar Lya pro A»n Leu Arfl Leu 
) IS 20 

? TAT AAC CCT CTT AGA CTC OCT TCT TCA CTG GCC GOT QGA CCA 
, Tyr Lya Pro Lau Arg Leu Art Cy« Ser v*l Al* Gly uiy Pi-^ 
30 35 40 

: OCX TCT TCA AAA ATC CAA CCC 00* CGA GCC ACC ACC ATC ACC 

L Cly Set Ser Lys lie Clu Gly Gly Gly Gly Thr Thr tie Thr 
45 SO 55 

r TOT CTC ATT CTC GCC OCA GOT ATT ACT OCT CTT TCC ATC OCT 

> Cye Vei II* V*l Cly Gly Cly lie Ser Cly Leu Cy* lie Ale 
60 65 70 

1 CTT OCT ACT AAC CAT CCT CAT OCT OCT CCC AAT TTA ATT CTC 
» Leu Ala Thr Lys Mia Pro Aap Ala Ala Pro Aan Leu Ila Val 

75 eo « 

; OCT AAC CAT CCT CTT CCA GCC AAC ATT ATC ACT CCT CAA C*C 
i Ala Lye Aap Arg Val Cly Cly Aan lie He Tfrr Arg Glu Glu 
) 95 100 

P 4"l T CTC TOG GAA CAA GOT CCC AAT ACT TtV CAA CcC TCT CAT 
f Phe L*» Trp Glu Clu Cly- Pra Aan Ser Ph« Gin Pro S«r AID 
110 115 120 

S CTC S.CT ATC CTG OTA GAT ACT GOT TTG AAC GAT CAT TTC CTG 

: Leu Thr Met Val Val Aap Ser Gly Leu Lya A>p Aap Lau Val 

125 130 135 

\ GAT CCT ACT GCC CCA ACC TTT OTC TT* TCC AAT CCO AAA TTO 
/ Aap Pro Thr Ala Pro Arg Phe val Leu Trp Aan Gly Lya Lau 
140 145 150 

; CTT CCA TOP. AAC CTA ACA CAC TTA CCC TTC TTT CAT TTC ATG 

> Val Pro S«r Lya Lau Thr A>p Lav pro Phe Pha Aap Leu Mac 

155 K0 165 
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P. T3 



J* 



5 


f 

Ser 


ATT 
lie 

170 


Gly 


CCC 

cly 


AAC ATT ASA 
Lyv Xla Arg 

175 


OCT OCT TTT 
Ala Cly Phe 


OCT 

ciy 


OCA CTT 
Ala Lau 
190 


OK ATT CCA 
Cly Xla Arg 


582 




COG 
Pro 


TCA 
S«r 


CCT 

Pro 


CCA 

Pro 


OCT CGT GAA 
Cly Arg Clu 


CAA TCT CTC 
Clu Sar Val 


GAG 

Glu 


GAC TTT 
Glu Pha 


GTA COG CGT 
Val Arg Arg 
200 


£30 


10 


AAC 

. Aan 


CTC 
Uv 


OCT 
Gly 


CAT 
A*p 


CAC GH 'L'l't 
Clu Val Phe 
205 


GAC CGC CKJ 
Glu Arg Leu 
210 


ATT 

He 


GAA CCC 
Clu Pro 


TTT TOT TCA 

Pha Cya Sex 

21i 


678 


15 


GOT 
cly 


CTT 
val 


TAT 
Tyr 


OCT 
Ala 

220 


OCT CAT CCT 
Cly Aep Pro 


TCA AAA CTC 
frar Lye Leu 
225 


ACC 
Ser 


ATC AAA 
Mat Lya 


OCA GCC TTT 
Ala Ala Pha 
230 


724 


20 


ccc 

Gly 


AAC 

Lya 


CTP 
Val 

235 


TQC 
Trp 


AAA CTA CAC 
Ly» bail Clu 


C AAT OCT 
C A*n Cly 
24 


GGA 

ciy 


AOC ATA 
Ser Jla 
245 


ATA OCT CGT 

lie cly Cly 


77* 




ACT 

Thr 


TTT 
Phe 

250 


AAG 
Lye 


CCA 
Ala 


ATT CAC GAC 
Zl« Gin Olu 
255 


ACC AAA AAC 
Arg Lya Aan 


CCT 
Ala 


CCC AAC 
Pro Lya 
2(0 


GCA CAA CCA 
Ala Clu Arg 


122 


25 


GAC 
265 


cce 
Pro 


coc 

AT9 


CTC 
I*u 


CCA AAA CCA 
Pro Lye Pro 
270 


CAC CGC CAA 
Gin Gly Cln 


ACA 
Thr 
275 


val Cly 


TCT TTC ACQ 

Sar pha Arg 

280 


670 


30 


AAC 


OCA 
Cly 


CTT 
Lau 


CCA 

ATfl 


ATG TTG CCA 
tWc Lsu Pro 
285 


CAA OCA ATA 
Clu Ala 11 a 
290 


TCT 
Sar 


oca ACA 
Ala Arg 


TTA GOT ACC 

Uu Gly s*r 

25*5 


91 B 


35 


AAA 
Lya 


CTT 
Val 


AAC 

i*y* 


TTG 
LeU 

300 


TCT TCC AAC 
Ser Trp Lye 


CTC TCA CCT 
Leu sar Gly 
305 


ATC 
Xla 


ACT AAC 
Thr Lya 


CTC GAG AGC 
Lau Clu Sar 
310 


966 


40 


CCA 
Cly 


CCA 

Cly 


TAC 

Tyr 

315 


AAC 
Aan 


TTA ACA TAT 
Leu Thr Tyr 


GAC ACT CCA 
Clu Thr Pro 
320 


CAT 

Aap 


CCT TTA 
Gly L«u 
325 


CTT TCC CTG 
Val S«r Val 


1014 


45 


CAC 

Gin 


ACC 
sar 


AAA 

Ly* 


ACT 

Ser 


CTT CTA ATC 
Vai val Mat 


ACC CTC CCA 
Thr Val pro 


TCT 
Smt 


CAT CTT 
Him Val 


CCA ACT CGT 
Ala Sar Cly 


1062 


CTC 
Leu 


TTG 
Leu 


CCC 

Arg 


CCT 
Pro 


CTT TCT GAA 

Leu Sar Olu 


TCT GCT CCA 
Sar Ala Ala 


AAT 

355 


CCA CTC 
Ala L*u 


TCA AAA CTA 
Ser Lya Leu 

360 


1110 


SO 


TAT 
Tyr 


TAC 
Tyr 


CCA 
Pre 


CCA 
Pro 


GTT CCA CCA 
Val Ala Ala 
365 


CTA TCT ATC 
Val Ser He 
370 


TCC 
Set 


TAC CCC 
Tyr Pro 


AAA GAA GCA 
Lya Glu Ala 

37S 


115a 


55 


ATC 

11c 


CCA 

Atg 


ACA 
Thr 


Glu 
380 


TCT TTG AT* 

Cya Leu lie 


CAT GCT CAA 
Aap Gly Clu 
385 


CTA 
LCU 


AAC CCT 
Lya Gly 


TTT CCC CAA 
Pna Gly Gin 
390 


1206 


60 


TTG 
Leu 


CAT 

Mil 


CCA 
Pro 
395 


COC 
Arg 


ACC CAA CCA 
Thr Gin Cly 


GTT GAA ACA 
Val Glu Thr 
400 


TTA 
Lau 


GGA ACT 

Gly lol 


ATC TAC ACC 
Zla Tyr Sar 


1254 




TCC 
Smt 


TCA 
Sar 


CTC 
Lau 


TTT 
Phe 


CCA AAT CCC 
Pro Aan Arg 


CCA CCC CCC 
Ala Pro Pro 
2< 


CCA 
Cly 


AGA ATT 
Arg Xla 


TTC CTG TTC 
Leu Lau Leu. 


1302 




m 


m 




*f • ' 
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i 



AAC TAC MT GQC GOG TCT ACM AAC ACC QU ATT CTB TCC AAG TCI GAA 
Aan Tyr lie Gly Cly Ser Thr Aen Thr Qly II* uu Str lyi Su GXu 
5 , 425 430 «15 440 

GGT GAG TTA GTG GAA OCA OTT GAC ACA GAT TTC AOS AAA ATC CTA ATT 
Gly Glu Leu V*l Glu Ala V»l up Arg A«p Leu Arg Lye Mac Lew II* 
445 450 455 

10 

AAG CCT AAT TOG ACC GAT CCA CTT AAA TTA QGA CTT ACC OTA TOG CCT 
Lye Pro Aen s«r Thr Aep Pre Lev Lya Leu Gly Vel Arc Vel Tip P*o 
460 445 410 

15 ■ CAA CCC ATT CCT CAC TTT CTA CTT GGT CAC TTT 6AT ATC CTT GAC ACG 
Gin Ale lie Pro Gin Phe Leu Vel Gly Hi* Phe Aep lie Leu A*p Thr 
47S 460 4SS 

OCTAAATCATCrcTAACGTCTTCGOaCTACGAAGGGetAWTTTeCCT 
20 Al* Lye Ser Sex Leu Thr Ser s«r Gly Tyr Glu Gly UuB»I« Gly 
490 *9S S00 

CSC AAT TAC GTC CCT GOT CTA CCC TTA GGC COG TOT CTA GAA GOC OCA 
Gly ami Tyr vel Ale Cly v*l AU Leu Gly Arg Cye VeX Glu Gly Ale 

23 ' 505 510 515 530 

TAT GAA ACC GCC ATT GAS CTC AAC AAC TTC ATC TCA COG TAC OCT TAC 
Tyr Glu TAr Ale lie Clu vel A*n am Phe n«e Ser Keg Tyr Ale Tyr 

525 510 535 

AAG TAAATGTAAA ACATTAAATC TCCCACCTTG COTGACTTTT ATTAAATaTT 

wye 



TXGACATATC CAAAAAAAAA AAAAAAAA 



(3) XNPORMATXON POR SEQ XO HO:2: 

111 SEQUENCE CHAAACTMUSTICS: 

(A) LENGTH : 537 AHlno acid* 

(B) TYPE; enino eCid 
(Dl TOPOLOGY: lin««r 

(U> K0L8CBLE TYPE: protein 

Ixil SEQUDCE DESCRIPTION; SEQ ZD NO:2; 

Hee Clu Leu Ser Leu Leu Arg Pro Thr Thr Gin Scr Uiu L«u Pro s«r 
15 10 15 

Phe Ser Ly» Pro Aen Leu Arg Leu Aan Vel Tyr Lye Pro Leu Arg t*u 

20 25 30 

Arg eye Ser vel Ale Gly Gly pre Thr Vel Gly Ser Ser Lye lie Glu 
35 40 45 

Gly Oly Gly Gly Thr Thr He Thr Thr Aip Cye vel Xle v»l Gly Cly 
50 5S <0 

Gly He Ser Gly Leu Cye He Al* Gin Ale Leg Ale Thr Lye Hie Pro 



Gly Am 
Pro AMI 



A/m c;y 
Lys Asn 
Cly Gl« 
Als Xlc 
Ser cly 

JOS 

Thr Pro 
Vsl pro 
Ala Ala 
Ssr lie 

no 

Cly Glu 
J 85 

Clu Thr 



ssr Pne Cln 

US 

r Lou Lys AJrp 

) 

L L«u Trp Asn 

i Pra Phe PK 
165 

t Gly Als Lsu 

100 

L Glu Clu n» 
1*5 

* ll« clu Pro 

) 

i Ser Ntc Lys 
Cly Ssr Zle 

245 

Als Pro Ly* 

260 

Thr Vsl Cly 
2'S 

Ssr Ala Arg 
He Thr Lys 

Asp Cly Lsu 

325 

Ser Hia Val 
310 

Asn Ala Lsu 

355 

Sar Tyr Pro 
Lru Lys Cly 



Leu lis 
Artr citi 
Pro S*r 

Asp Leu 

135 



Cly lie 
Val Arg 

Phs Cys 

325 

Ala Als 

230 

lis Cly 
Ala Glu 
Ser Pne 



Ala Scr 
S«r Lyi 



Val Thr Glu Ala Lya Aap I 



Asp Pro 

120 

val Lsu 
Lsu Ary 



Hst LSU Thr Met 

125 

Cly Asp Pro Thr 
140 

Pro Val Pro S«r 



Arg Pro Sox Pre 

105 

Arg Asn bau cly 

200 

Ssr Gly V*l Tyr 

Phs -/a Vsl 

235 



390 
• Us Tyr 



Cly Thr 
Arg Asp 

265 

Arg Lys 
200 

Ssr Lys 

Ssr Cly 

vsl Gin 

Gly Leu 
345 

360 ^ 
Ala II* 

Cln Uu 

ssr Ssr 



250 
Pro Are 

Cly Lsu 

Vsl Lys 

Cly Tyr 

315 

Ssr Lys 

33o 

Lsu Arg 
Tyr Pro 
Arg Thr 

Hia Pro 

395 

S*r Lsu 

410 



Cly Lys 
Pro Cly 



Trp Lys 
Ala lis 
Leu Pro 



300 

Asn Lsu 
ssr Vsl 

Pro Lsu 

Pro Val 

365 

Clu cys 
Arg Thr e 
phs Pro J 



Vsl val A*r 

Als Pre Arg 

Lys Lsu Thr 
ICO 

Zls Arg Als 

175 

Arg Clu Clu 
190 

V«l Phs Glu 



Lys Pro Cln 
210 

Lsu Pro Clu 

Trp Lys Leu 

Thr Tyr Glu 
320 

vsl net Thr 

315 

S*r Clu Ser 

350 

Als Als Vsl 
Lsu lis Asp 



'ST 
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■ l*o Oly 


Aro Ila Lay 
420 


l<w uu Mi Tyr ila 
425 


Cly Gly 


Sar 

430 


Thr Asa 


Thr Cly lie 
435 


Lau £*r Lya 


S#* clu cly Clu Lau 

440 


Val Glu 
445 


Ala 


Val Asp 


Ary Mp Lav 
ISO 


Xry Lya Mat 


Lau lie Lya Pro Mn 
455 


Sar rhr 
460 


A*P 


Pro Law 


Lya L*U Cly 
465 


Val Arg Val 
470 


Tm pro Gift Ala Ila 
475 


Pro Gin 


n* 


lcu val 
460 


Cly Hi* fh« 


Aap 21c Lau 


Asp Thr Al» Lya Sar 
4»0 


Sar Lau 


Thr 


S«r Scr 
4»S 


Gly Tyr Glu 


Gly Lou Ptw 
SOD 


Lcu cly cly Am Tyr 
SOS 


Val Ala 


Cly 
SiO 


Val Ala 


Lau Gly Axg 

SIS 


Cys Val Clu 


Gly Ala Tyr Glu Thr 

sao 


JUa 11* 

525 


Glu 


Val Asn 


Am Pha M; 

530 


&«r Ajcg tyr 


Al« Tyr Lya 

S3S 








(2) XNFORKATZON FOX SBQ 


ID WO: J: 









(lj SEQOIMCt CHARACTERISTICS: 

(A) LtMSTH: 1731 baa* pairs 
(SI TYPE! nucleic acid 
IT) STRANDfiCNSSS: siagla 
<0) TOWLOCY: linear 

(il) MOLECULE TYPE: CDMA 

liiil HYPOTHETICAL: NO 

(iv) ANTX-5INSB: MO 



tl«) FEATURE: 

IA) KAMI /KEY: CDS 

(B) LOCATION: 70..1S9C 

(OJ OTHER INFORMATION: /noca- *Arabiaopais pro COX -2 CDWA; 
sequeftca frott pWOC-1* 

Ui) SEQUENCE DESCRIPTION: SEC ID NO .3: 

TTTTTTACTT ATTTCCGTCA CTGCTTTCGA CTGCTCAGAG ATTTTCACTC TGAATTCTTG 

CACATAOCA ATC GCG TCT CCA CCA OTA CCA CAT CAT CAA ATT CAA GCC 
Met Ala S«r Gly Ala Val Ala Aap Hia Gin lie Clu Ala 

1 5 10 

err TCA oca aaa aca gtc cca etc cta got oca got cta act oca err 
Val S«r Gly Lya Arg val Ala val val Gly Ala Gly val Ser Gly Lou 
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IS 



12& 

4 







OCT GAT CCA 
Ala Aap Gly 


AGA 

Aro 
50 


GTA 

Val 


?5 


GGG 
Gly 


AAG 
Lya 


TTG 
Leu 

55 


AGA ACT GTT ATG 
Arg Ser Val Met 


CAA 

Gin 
60 


AAT 
Aan 


252 


5 


OCT 
Gly 


TTO ATT TOG 
Let! III) TXP 

6S 


GAT 
A*P 


GAA 

clu 


OGA 
Gly 


CCA 
Ala 


AAC 
Aan 

70 


ACC 
Thr 


ATG ACT GAS OCT 
Met Thr Glu Ala 
75 


GAG 
GlU 


CCA 

Pro 


300 


10 


GAA 

Clu 


err gbg act 
Val Gly Ser 
30 


TTA 
Leu 


CTT 
Leu 


CAT 
Aap 


GAT 
Aap 

85 


CTT 
Leu 


GOG 
Gly 


CTT CGT GAG AAA 
Leu Arg Glu Lya 
90 


CAA 

Gin 


CAA 

Gin 


348 


IS 


TTT 
Phe 


CCA ATT TCA 
Pro 11* Sar 
95 


CAC 
cln 


AM 
Ly* 


AAG 
Ly» 

100 


CGC 
At* 


TAT 

Tyr 


ATT 
Zle 


CTC CGG AAT CGT 
val Arg Aan Gly 

105 


GTA 
Val 


CCT 
Pro 


396 




Val 

110 


ATG CTA CCT 
Met L«U Pro 


ACC 

Thr 


AAT 
Aan 

us 


CCC 
Pro 


ATA 
I la 


GAG 

Clu 


CTG 
Lew 


CTC ACA AST ACT 
Val Thr Sar far 

120 


Val 


CTC 
Leu 
125 


144 


20 


TCT 
Ser 


ACC CAA TCT 

Thr cln s«r 


AAG 
130 


TTT 
Phe 


CAA 
Gltt 


ATC 
He 


TTC 
Leu 


TTG 
Leu 
13S 


GAA CCA TTT TTA 
Glu Pro Phe Leu 


TOO 
Trp 
1*0 


AAG 
Lya 


492 


25 


AAA 


AAG TCC TCA 
Lya Ser Ser 

1*5 


AAA 
Lya 


CTC 
Val 


TCA 
Ser 


GAT 
Aap 


GCA 
Ala 

150 


TCT 

Set 


OCT GAA GAA ACT 

Ala clu Glv ser 

155 


GTA 

val 


ACC 
ser 


540 


30 


GAG 

Glu 


TTC TTT CAA 

Ph« Phe Gin 
160 


CGC 
ATfl 


CAT 
Hi a 


Phe 


GGA 
Gly 
165 


CAA 

Gin 


CAC 
GlU 


GTT CTT CAC TAT 
val val Aap Tyr 

170 


CTC 
uu 


ATC 
Zle 


588 


35 


CAC 

ASP 


CCT TTT CTT 
pro Phe v«i 

1?5 


CGT 
Gly 


CCA 

5iy 


ACA 

Thr 

180 


ACT 

Ser 


OCT 

Ala 


CCC 
Ala 


CAC CCT GAT TCC 
Aap Pro Aap Ser 

185 


CTT 
Leu 


TCA 
Ser 


636 




ATG 

Mat 

190 


AAB CAT TCT 
Lye His Ser 


TTC 
Phe 


CCA 
Pro 
195 


GAT 
Aap 


CTC 
Leu 


TGG 
Trp 


AAT 

Aan 


GTA GAG AAA ACT 
Val Glu Lya Sar 

200 


Phe 


GGC 

Gly 
505 


£84 


40 


TCT 
Ser 


ATT ATA 6TC 
He :lc Val 


GOT 
Gly 
210 


GCA 
Ala 


ATC 
lie 


ACA 
Arg 


ACA 
Thr 


AAG 
Lya 
31S 


TTT OCT CCT AAA 
Phe Ala Ala Lya 


Gly 
330 


GCT 
Gly 


"732 


45 


AAA 
Lye 


ACT AGA CAC 
Sar ATC AAp 

225 


ACA 

Thr 


AAG 

Lya 


ACT 
Ser 


TCT 
ser 


Pro 
330 


GGC 

Gly 


ACA AAA AAC GCT 

Thr Lya Lya Gly 

335 


TCG 
Sar 


CGT 
ATfl 


780 


50 


COG 
Gly 


TCA 1*TC *CT 
ser Phe ser 
210 


TTT 
Phe 


AAC 

Lya 


GCG 

Gly 


GGA 

Gly 
245 


ATC 

Met 


CAC 

Gin 


ATT CTT CCT GAT 

He Leu Pro Aap 
250 


ACC 

Thr 


TTG 

Leu 


828 


55 


TCC 
Cye 


AAA ACT CTC 
Lya Ser Leu 
255 


TCA 
Ser 


CAT 
Hva 


GAT 
Aap 

260 


GAG 
Clu 


ATC 
lie 


AAT 
Aan 


TTA CAC TCC AAG 
Leu Aap Sar Lya 
365 


GTA 
val 


CTC 
Leu 


876 




TCT 
S« 
270 


TTC TCT TAC 
Lau Sar Tyr 


AAT 
AMI 


ser 

275 


OCA 
Gly 


TCA 
Ser 


ACA 
Are 


CAC 
Gin 


GAG AAC TGG TCA 
Glu Aan Trp Ser 

280 


TTA 
Leu 


TCT 
Ser 
285 


924 


60 


TCT 

Cym 


GTT TCG CAT 
Val Ser Hi a 


AAT 
Aan 

290 


GAA 

Clu 


ACC 
Thr 


CAC 

Gin 


AGA 

ATfl 


CAA 

Gin 
2»5 


AAC CCC CAT TAT 
Aan Pro Mia Tyr 


GAT 
Aap 

300 


OCT 
Ala 


512 
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V»l 


ATT 
He 


ATS 
Net 


ACO 

Thr 
305 




CCT CTC TCC AAT 
Pro Leu Cye Am 
310 


via 


AAC CAC 


ATO 


AAG 

315 


CTT 


ATC 


1020 


AAA 

Ly» 


CCA 
Cly 


CCA 
Cly 
320 


CAA 

Gin 


CCC 
Pro 


TTT CAO CTA AAC 
Phe Gin Leu Aan 

325 


TTT 


CTC CCC 


GAG 
330 


ATT 


AAT 


TAC 

Tyr 


1068 


ATO 
M«C 


CCC 
Pro 
335 


CTC 

UM 


TCC 
Bmt 


CTT 
Vel 


TTA ATC ACC ACA 

L*u He Thr Thr 
340 


TTC 


ACA AAG 

Thr Lyn 

34S 


CAC 


AAA 
Ly» 


V»l 


AAC 
Lye 


1116 


ACA 
Ary 

3S0 


CCT 
Pro 


CTT 
Leu 


CAA 
Glu 


CCC 

Gly 


TTT GGG CTA CTC 
Phe Gly vel Leu 

3SS 


ATT 
Il« 


CCA TCT 
360 


AAG 
Ly» 


of 


CAA 


AAC 

Lya 
365 


1164 


CAT 
Hi* 


OCT 
Cly 


TTC 
Phe 


AAA 


ACT 
Thr 
370 


CTA GOT ACA CTT 
Leu Cly Thr Leu 


Phe 
375 


TCA TCA 
Ser s«r 


ATS 


ATG 


TTT 

380 


CCA 


1213 


CAT 
A»P 


CCT 
Ars 


TCC 
Sar 


CCT 
Pro 
3B5 


ACT 
Ser 


CAC CTT CAT CTA 
Acs Val Hia Leu 
390 


TAT 
Tyr 


ACA ACT 
Thr Thr 


TTT 


ATT 
I la 

395 


OCT 


GGG 


1260 


S€T 




AAC 
Aan 
400 


CAC 
Gin 


CAA 

Cly 


CTA CCC AAA CCT 
Lew Ala Ly* Ala 
405 


TCC 
Smz 


ACT CAC 

Thr A*p 


CAA 
ClW 
410 


TTA 
L*u 


AAA 
Ly« 


CAA 

Gin 


130B 


CTT 
Val 


V*l 
415 


ACT 

Thr 


TCT 
ser 


CAC 
A*W 


em CAc cca era 

bcu Gin Arfl Leu 
430 


TTC 


Gly V*l 


CAA 


Gly 


CAA 


CCC 


1356 


vli 

430 


TCT 
S«r 


CTC 
Val 


AAC 
Aan 


CAT 
HXB 


TAC TAT TCC ACC 
Tyr Tyr Trp Arg 
435 


AAA 

Ly» 


CCA TTC 

Ala Phe 

440 


CCC 
Pro 


TTG 
Leu 


TAT 

Tyr 


GAC 
Aap 
445 


1404 


AOC 


ACC 
Sar 


TAT 
Tyr 


SAC 
Aap 


TCA 
SM 

450 


CTC ATC CAA CCA 
Val Mac clu Ala 


ATT 
XI* 
455 


GAC AAC 
Aap Lya 


ATC 
Hat 


GAC 

Clu 


AAT 
Aan 

460 


GAT 
Asp 


1452 


CTA 
Leu 


CCT 

pro 


SCC 

Cly 


TTC 
PKe 
4SS 


TTC 

FtM 


TAT CCA CCT AAT 
Tyr Ala Cly A*.i 
470 


CAT 
Hii 


CCA GCC 
Ar* Gly 


CCC 
Cly 


CTC 
Leu 
«75 


TCT 
Ser 


GTT 
Val 


1500 


CCC 
Cly 


AAA 

Lya 


TCA 
Ser 
480 


ATA 

lie 


CCA 
Ala 


TCA CCT TCC AAA 
S«r Cly Cy» Lya 
485 


CCA 
Ale 


OCT CAC 
Ala Aap 


CTT 
Leu 
490 


CTC 
Val 


ATC 
11« 


TCA 
Ser 


1548 


TAC 
Tyr 


CTC 
LCU 
495 


SAC 

Glu 


TCT 
ser 


TCC 


TCA AAT GAC AAC 
Scr Aan Ajp Ly« 

SCO 


AAA 

Lya 


CCA AAT 
Pro Aan 

505 


GAC 
Aap 


ACC 
scr 


TTA 
Leu 


TAACATTCTC 1 



AAGCTTCGTC CCTTTTTATC ACTTACTTTG TAAACTTCTA 
CCATTAOCCA ACAACTCAGC AAAACCCAOA TTCTCATAAG 
ACTATTTATC 7AAAA 

(3) INFORMATION FOR SIQ IP NO: I: 

(1) SEQUENCE CHAAACTeAXSTICS: 



AAATCCAACA ACCCCCCCTC 1663 
GCTCACTAAT TCCACAATAA 1723 
17Jfl 



P. 78 



I 

•A 

I 
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(A) ianv»t 508 nine acida 

(B) TYFC: amino W« 
(D) TOPOLOGY: IIomx 

5 (11) HQLECOLE TTFC: protein 

(nil SZOOENCS DSSCXimOH: SEQ ZO !»:«: 

Hat AJ.A sar Cly Ala Val Ala Aap HI* a In xl« Glu 

10 1 5 I* 

Ly* Arg VAl Ala Val Vel Cly Ala Cly Val 5«r Gly 
IS Tyr 



Gly 



■ Ly. Lau Lya Scr Arg Gly Leu Aan Val Thr val 
35 «0 

Arg Val Cly Cly Ly* Lau Arg Sar Val it( Cln 

50 55 60 

Aia Clu Cly Ala A*n Thr Met Thr Clu Al* Glu 

70 75 

L*u Lau A*p A** L*u Oly L«u Aro Glu Lya Gin 

65 90 

Gin Lya Lye Ary Tyr II* V*l Arg am cly Val 



100 



105 



Ala val 

L*u Ala 

30 

Mm Clu 
4S 

Au Gly 
pro Glu 
Cln Pba 

pro val I 

110 

Uu Sar 

125 

) Lya Ly* 

) 

L Sar Glu 
i Xla Asp 
l S*r Mat 



Lau XI* 

Val Cly 

Pro II* 

95 

Mac Leu 
TUT cln 
Lya S*r 



190 



Val Cly Ala lie Aro Thr Lya Ph* Ala Ala Lya Cly Gly Lya 

310 315 230 

Aap Thr Lya Ser sar Pro Gly Thr Lya Lya Gly Sar Arg Gly 

1^5 230 135 

Sar Pha Lyi Cly Gly Kat Cln tie L*u Pro Aap Thr Lau Cya 



60 Tyr 
Mil 



Aan Glu Thr Gift Arg cln Aan Pro Hi* Tyr Aap Ala Val 



Lya Hia 
Xla Xla 

Ser Arg 

Sar Phe 
240 

Lya Sar 

255 

Lau S*r 
Val Sar 
xle n*c 



5 
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13 



r ii 



290 


295 




305 * 1 * 


(•j-o cya Aan Val 
310 






Ph* Gin L«u Aan Pbe 

325 


l*u 




L*u Il« Thr Thr Pha 
340 




Clu Gly 


Ph« Cly v*i L*u zi* 

35S 


360 


ty * 370 


L«u Cly Thi" Lau Plia 
3M 


s«r 


385 


MP Val Hia L*u Tyr 

390 


Thr 


filrt Clu 


L*u Ala Ly* Ala Sar 
405 


T&r 




Lau Gin Axg X«u L«u 
420 


Gly 


Aan Kit 


Tyr Tyr Trp Arg Lya 
435 


Ala 
440 


Mp ser 

450 


val Met Clu Ala lit 
455 


MP 


Phe Ph* 
465 


Tyr *J.m Cly Mn till 
4-JO 


AXD 


lie Ala 


s«r cly eye Lyt Ala 

405 


Ala 


Ser Cya 


S«r Mn MP Ly* Ly« 


fro 



: Pro Lau 

335 

7 Pro lau 

) 

■ Gly M» 

3 Ary S«r 

f Arfl Aan 
400 

I Val Thr 
415 

L 8*r Val 
} 

r Ser Tyr 
i pro Gly 



480 



t±3 



(2) INFORMATION FOR SEQ ID NO:S: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1(91 baaa pair* 
(81 TYPE: nucleic acid 
IC) STRAND EDN ESS ; ainala 
(D) TOPOLOGY: linaar 

lii) MOLECULE TYPE: cDNA 

UiU HYPOTHETICAL.- NO 

(lv| ANTI-SEMSB: NO 



Ux> FEATURE: 

(A) KAKE/KTY; COS 

IB) LOCATION: J... 14* J 

(D) OTHER INFORMATION : /note- ptotox-l 
cPNA I not t uU-lanythl ; aaquanea troa pWSC-4; ttr*t 
nuclaocldaa raaovad v*. tint proviaional ' 
33 



R : 6 2 4 



(Xi.) SZQD8KX PESOUPTX0N: SEQ ID HQ.5i 
I GAC TOC GTC CTC GTC GCC OCA GCC ATC ACT OGC C 

I Alp Cy* V«l val Val Gly Gly Gly Xia s«f Gly i 



;CCTCTGGCCAC6C0GOU:CCCBTC00CGACCT0« 
a Ala L*u Ma Thr Aro Hi* Gly Val Gly Aap Val I 



: CCC GCC CCC CCC GGC GGC AAC ATT ACC ACC GTC C 
» Arfl Ala Arff Pro Gly Gly Aaa Ho Thr Thr v*l r 



'I - - 



k«RtACCrCTCCCAGGAGCCTCCCAACAGCTTC< 
j sly Tyr ueu Trp Clu Gly Gly Pro Aaa Ser Pha C 



! GAT GAC TTG GTT 



f COG CAC CCA AAC GCC CCG CGT TTC CTC CTC TOG C 
9 Gly Asp Pre Aan Ala Pro Arg Ph* Val Imx Trp ( 



; CCC GTC CCA TCC AAC CCC GCC GAC CTC CCG TTC 
I Pro v.." Pro Ser Lya Pro Ala Asp Leu Pr« Ph* 



TTC CAT C 
Phe Aap 1 

110 



; CCT CCT CCA GGC CCC GAA SAG TCA CTG CAC GAC T 

> Pro Pro Pro Gly Are Glu Clu S«r v*l Glu Glu l 
110 115 l«0 

: CTC CGT GCT GAG GTC TTT GAG CGC CTC ATT GAG £ 
i htm Gly Ala Glu Val Phe Glu Ar« L*u n« Glu t 

> 150 155 



J AAG GTT TOG CGC TTG GAA GAA ACT GGA GCT ACT t 
t Lya Val Trp Aro L*u Glu Clu Thr Gly Gly S«r 1 
180 185 



»»A CCG AGS 
i Lys Pro pro Aro 
20S 



r CCC CGC CTT CCG AAG CCA AAA GGG CAG ACA GTT C 

? All Aro l«u Pro Lya Pro Lya Gly Gin Thr Val J 

210 215 220 

* GOT CTT CCC ATC CTT CCA AAT CCC ATT ACA TCC J 

■ cly Lau Ala Mac Leu Pro Aan Ala I la Thr S«r S 



'03¥07J]14B (I) liftlft ft:B S K B £t:|ffl«>Mi) R: 6 2 4 P. 82 
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> . . , ser scr ueu ru rro am acb "» *»■*» ~ v »»* « 

355 360 S6S 

A/.C TAC ATA CCA OCT OCT ACA AAC ACA CGA ATT CTt TCC AAG ACT GAA 

Aen Tyr lie cly Gly Ai* Thr A«n Tttr «y He Vel Ser Lye Thr Clu 

310 315 380 

ACT GAG CTC CTC GAA OCA CTT GAC OCT SAC CTC CCA AAA ATG CTT ATA 

Ser Clu Leu Vel Clu Alt Vel Aep Ar a Aep Leu Arg Lye IMC Leu lie 
385 390 395 400 

AXT TCT ACA CCA CTC CAC CCT ^A CTC CTT COT CTT CCA GTT TCC CCA 

Aen ser Thr Ale vel Asp Pro Uu Val Leu Cly val Ara v»i Trp wo 
405 «10 415 

CAA CCC ATA CCT CAC TTC CTC CTA OCA CAT CTT CAT CTT CTC CAA CCC 

Cln *1> lie Pro Gin Ph« Leu V*l Cly Hit Leu A»p Leu Leu Clu Ale 
420 425 430 

CCA AAA CCT OCC CTC CAC CCA OiT CCC TAC GAT CCC CTC TTC CTA CCA 

Ale Lye Ala Ala L«u Aap Mb Cly Cly Tyr A*P Cly Leu Ph* L«U Cly 

435 4«0 445 

GCC AAC TAT CTT OCA qCa CTT CCC CTC GCC ACA TCC CTT CAC GCC CCC 

cly Aen Tyr Val Ala Gly Vel Ale Uu Cly Arg Cya vel Clu Cly Ala 
450 455 460 

TAT CAA ACT GCC TCC CAA ATA TCT CAC TtC TTG ACC AAC TAT GCC TAC 

Tyr Glu Ser Al* Ser Cln He Ser Asp Ph* Leu Thr Lye Tyr Ala Tyr 
46S 470 475 480 

AAC TCATCAAAGA AGTGGACCCC TACTTCTTAA TCCTTTATCT TCCATAGATC 

Lye 

IS 



JO 



2^* 



AJkj '."C AAA CTA TCA TOG AAA CTC ACC ACC ATT ACA AAA TCA CAT CAC 768 
Ly» Val bye Leu Ser Trp Lye T-e* TN? S*r II* Thr Lye Ser A»p Aep J.-^. 
24S 350 255 

AAC OCA TAT GTT TTC CAC TAT CAA ACC CCA CAA CCC CTT CTT TCC CTC fll6 
Lye Cly Tyr Val Leu Clu Tyr Clu Thr Pro Clu Cly Val Vel Smr Vel 

260 265 270 ^ 

CAS OCT AAA ACT GTT ATC ATG ACT ATT CCA TCA TAT CTT OCT ACC AAC M4 "J 

Cln Al« Lyt Sex Vel He Met Thr He Pro Ser Tyr Vel Ala Ser A*n 1 

275 21.0 295 ; 

ATT TTC COT CCA CTT TCA ACC GAT CCT CCA GAT OCT CTA TCA AGA TTC 512 j^J 

lie <«u Arc Pre Leu ser ser Aep Ale Ale Aep Ale Leu Ser Art PM 1 

• 290 295 300 T^T 

TAT TAT CCA CCC GTT OCT OCT CTA ACT GTT TCC TAT CCA AAC CAA CCA 960 

Tyr Tyr Pro Pro Vel Ale Ale val Thr vel Ser Tyr Pro Lye -ttu Ale 4 
305 310 315 320 

ATT ACA AAA CAA TOC TTA ATT GAT GOG t«A CTC CAS GGC TTT GCC CA3 1008 H j 

lie Arg Lye Glu eye Leu lie Aep «ly Glu Leu Gin Gly Phe Gly Gin =**^ 

125 J3C 33* ;.^| 

TTC CAT CCA OCT ACT CAA GGA CTT CAC ACA TTA OCA ACA ATA TAC ACT 1056 \ '^j 

Leu Hie Pro Arg Ser Cln Cly vel Glu Thr Leu Gly Thr lie Tyr Ser . * 

340 345 350 

TCC TCA CTC TTT CCA AAT CCT CCT CCT CAC CCT ACC CTC TTA CTT CTA 1104 ,« 

S*r Ser Leu Phe Pro Aen Arg Ale Pro Aep Gly Arg Vel fceu Leu Leu 'O /J 



'03$07jjHB(jj)Uti52#^:B S K B 
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^CT O CCTCC COOTAyJtM* AAOCTTGAAT AGTATTTTTr ATTCTTATTT TGTAAATPGC 
ATTTCTGTTC MTTTTCTAT CAGTAATTAG TTATATTTTA OTTCTGTACC AGATTCTTCT 

GTTCACTCCC CTTCAAAAGA aatcttattt ttcattcttt tatcagaqct GTGCTACTTA 



(2> INTORHATJOW FOR SfcQ IB HO: 6: 

II) SSQOQ1CS CHAKACraRISTICS: 

(A) LENGTH: 481 aaino aeide 
(fl> TYPEs Mfdho acid 
(D) TOPOLOGY: linear 

KOLBCOtE TYPE: pratein 

Ui) SEQUENCE DESCRIPTION: STQ ID NO: 6: 



23 


Ala 


A*p 


cya 


v»i 


Val 

5 


val 


Gly 


ciy 


oly 


lie 
10 


5«r 


Gly Lau Cya Thr Ala 


Cln 


Al* 


Leu 


Ala 

20 


Thr 


Arg 


Hi* 


Gly 


val 

3S 


Gly 


Aap 


Val Lau Val Thr Glu 

30 


30 


Ala 


Arg 


Ala 

35 


Ara 


Pro 


ciy 


cly 


Aan 
40 


Il« 


Thr 


Thr 


Val Glu Arg Pro Glu 
4S 




Glu 


Cly 
SO 


Tyr 


Leu 


T.T> 


GU 


Glu 
SS 


Gly 


FTP 


Aan 


S«r 


Phe Gin Pre Ser Aap 

60 


35 


pro 
S5 


val 


Lau 


Thr 


Mat 


Ala 

70 


Val 


Aap 


S«r 


Gly 


Lau 

75 


Lya Aap Aap Lav Val 

B0 




Phe 


Cly 


ASP 


Pre 


Aan 

as 


Ala 


Pro 


Aro 


Phe 


Val 

90 


Leu 


Trp Glu Gly Lya Lew 
95 


40 


Arg 


Pro 


Val 


Pro 

100 


Se* 


Lya 


Pro 


Ala 


Aap 

LOS 


Lau 


Pro 


Phe Phe Asp Lau Met 

110 


45. 


Mr 


lie 


Pro 
US 


Sly 


Lya 


Leu 


Arg 


Ala 

120 


Gly 


Lau 


Gly 


Ala Leu Gly I la Are 

125 




Pro 


Pro 

130 


fro 


Pro 


Gly 


Arg 


clu 
135 


Glu 


Ser 


val 


Glu 


Glu Phe Va' Arg Arg 

140 


30 


Kgn 
145 


Leu 


ciy 


Ala 


Glu 


Val 
ISO 


Phe 


Glu 


ATtf 


L«U 


He 


Clu Pro Phe Cya Ser 

ISO 




Cly 


Val 


Tyr 


Ala 


Cly 
16$ 


Aap 


Pro 


Sor 


LyB 


Lau 
170 


Ser 


Met Lya Ala Ala Phe 

175 


JS 


Cly 


-y» 


v*l 


Trv 
ISO 


Arc 


L*" 




clu 


Thr 
165 


Cly 


Cly 


sar Zla Ha Gly Gly 
190 


60 


Thr 


Ha 


Lya 

195 


Thr 


11* 


Cln 


Clu 


Arg 

200 


Smt 


Lya 


Aan 


Pro Lya Pro Pro Arg 
SOS 




A»p 


Ala 

310 


>?9 


Leu 


Pro 


Ly- 


Pro 
21b 


Lya 


Gly 


Cln Tta. 
3o 


V«l Ala Sar The Arg 

220 



1553 
1615 
1673 
1691 
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Lye Gly Tyr Vel Leu Glu Tyr Clu Thr Pro Glu Gly V«l \ 
260 265 3 



lie Arc Lye Glu Cy* Leu lie Asp Gly Glu Leu Gin Gly 

325 330 

Uu Hit Pro Arg S«r Gin Gly Vei Glu Thr Uu Gly Thr 



Alt Stf Ml 
Ser Arg Pa* 

Lv» Glu Alt 

320 

Am Gly Gin 
335 

lie ryr set 

350 



A*n Tyr lie Gly Gly Al* Thr A«i Thr Gly lie Vel Ber Lye Thr Glu 



Clu Al# He Pro Gin Phe Leu V4l Gly Ki« Leu Asp Leu 
420 "5 



r v*l Trp Pro 
41S 

Leu Glu Ale 
4)0 

i phe Leu cly 

L Clu Gly Ala 

t Tyr Ale Tyr 
4S0 



(2) INFORMATION FOR SEQ IS NO: 7: 

(1) SEQUENCE CHARACTERISTICS: 

|A) LENGTH: 2061 DM* pelt* 
<B) TYPE.- nucleic eeld 
id STRAND60NESS : ■inflle 
(O) TOPOLOGY: linee* 

(it) KOLECW.l' TYPE: cBrtt 



'OJttMMBW) 14*52$ S K B ftttftffl^Mft) 
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Hit) HYPCTHSTtCAt; IK 
Uv] AKTI-S2HSE: W> 



(lxl FEATURE: 



(A) NAK£/K£Y: COS 

(B) lOCATWW: M..1698 

(Dl OTHER INFORMATION: /note* "tUize precox -2 cflNA.- 
«cquencs fro* pwsc-3' 



Cxi) SSQUENCt DESCRimOH: S£Q ZD NO: 7; 
CTCTCCTACC TCCACCTCCA CCACAACAAG CAAATCCCCA 



TCCAGTTCCA AACCCTAACT 



55$ 

m 



8 



CTC ACA CAC ACC CCC CTG AAC CTA AOS GTC TTC C 
I*u Arft Qln S«r ely V«l A*n V*l Thr V»l tb§ ( 

65 70 

CCC GGA GO* AAC ATA COG ACC AAT TCC GAO CGC 
AX* Gly Gly Lyw Zl« Arc Thr Aan S«r Clu (Sly 

00 as »o 

CAA GGA CCT AAC ACC ATG ACA GAA CCT GAA TGG 
Clu Sly Al* Aar. Thr H«C Thr Clu Gly Clu Trp 
100 10$ 

ATT CAT CAT CTT GOT CTA CAA CAC AAA CAC CAC 1 
llm Asp Aap i*u Gly L«u Clo A«p Ly* Clft cln 1 



CCC TTT CTC TOG CAT 
Cly »h» V»l Tip A»t> 



CAC CCC ACT ACA CTC 
Clu Al» S*r Arg L«u 

110 



GAT CCC ATP TCC CTA ATC AAA ACC AGT GTT CTT 1 
A*p Pro Il« S«r L*u net Ly« S«r S«r V*i Leu £ 
145 150 



i AAA GCT AAC ACA ACA 
■ Alt Aan Thr Arg 

175 



'03M14BO!) Ug53# $:B S K B 
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•TOT GAA CXiC CAC 1 
■Cy» Glu Aro Hia 1 
195 

* TtT GTA CCT GGA J 
s Pb* Val Al* Gly 1 
210 



P TTT on CAT 
r Phe v*l Aap 
205 



r OCA GSA c 
r Al* Cly 4 
215 



l CAC TCA < 
) Glu Sar 1 
230 

L ASA AAO 
i Arg Ly» 

235 



i CTA OCA G 
I L*u Ala A 

350 

; AAA ASA * 
r Lya Arg * 

ass 



r GAA CTT GGA CAT G 
i Glu Val Cly Asp J 
290 



r CTT GAT TCC AAG C 

t val Aap Set Lya 1 



I AAG TTC ACC AAA C 
i Lya Phr Thr Ly* « 

355 

3 ATC CAT TAT CTA < 

i Met Asp Tyr Lau i 



^ ATG CAG 1 
f Met Cln S 
280 

f GTC AAC C 

i val Lya I 

295 



: cor cac J 
r cly Mp I 



TAT OCT TCA 
Tyr Cly Sar 

\ OCT GAT OCA 

i Cly Aap pro 

255 



r OCX CTT CAC 
i Ala Lau Hi* 

285 



r AGT AAC CAA 

i s«r Ant Gla 

335 



r CCA TTC 1 

i Pro £*u i 
345 



5 ACT CCT 1 
L Thr Ala 1 
380 



^ CCT CTG C 
i Pro L«u £ 
390 



k ACC CTT C 
i Thr Leu C 
410 



l ATG ATC TTC CCA < 
■ Hoc net Pha pro i 

HZ 



r TTA TAT ACA 
7 Lau Tyr Thr 
430 
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TCT ATT CPO AAA CAA CTT CTG ACC TCT GAC CTT MA AAA CTC TTG GGC 
5«r IU lm Lye Gin Lev vel Thr Ser Aep Leu Ly» by Leu Leu Gly 
4S0 455 460 

OTA SAC GGC CAA CCA ACT TTT GTC AAC CAT GTA TAC WW GGA AAT GCT 
Vel Glu Gly Gin Pro Tnx Phe Vel Lye Hie Vel Tyr Ttp Cly Ami Ale 
465 4?0 4« 

TTT CCT TTtJ TAT GGC CAT GAT TAT ACT TCT GTA TTG CAA OCT ATA CAA 

Mm m Lau Tyr sly Hi* Aip iyr s«r sti Vil Lw Glu Ale He Glu 

480 465 490 495 

AAG ATX3 GAG AAA AAC CTT CCA GOG TTC TTC TAC GCA GGA AAT AuC AAG 
Lye Met Glu Lye Aen Leu Pro 0*/ Phe Phe Tyr Ale Gly Aen 8«r Lye 

500 505 510 

GAT GGC CTT GCT CTT GGA ACT GTT ATA GCT TCA GGA AGC AAG GCT GCT 
Aep Gly Leu Ale Vel Gly Scr Vel He Ale Set Gly S«r Lys Ale Ale 

515 520 52$ 

GAC CTT GCA ATC TCA TAT CTT GAA TCT CAC ACC AAG CAT AAT AAT TCA 
Aep Leu Ale lie 5*r Tyr Leu Clu Ser Hie Tar Lye Hie Aen Aen Ser 

530 535 S40 

CAT TGAAACTCTC TGACCTATCC TCTACCACTT GTCSACAAAT TTCCCAOTT 
Hie 

545 

CATCrACAGT AQAAACCQAT GCGTTCCACT TTCAGAACAT CTTCACTTCT TCAGATATTA 
ACCCTTCGTT GAACATCCAC CAGAAAGGTA GTCACATGTG TAAGTGCGAA AATGAGCTTA 
AAAACTATTA TGGCGGCCGA AATGTTCCTT TTTGTTTTCC TCACAACTGG CCTA CQ ACAC 
TTGATGTTGC AAATACATTT AAATTTGTTG AATTGTTTGA CAACACATGC GTGACCTGTA 

atatttgcct attgtcattt tagcaotact cttggccaca ttatocttta cgcctttaaa 



1605 
1665 
1925 
1985 
2045 
2061 



m 



(2) INFORMATION t-OR SZQ ID NO.-9: 

lij SEQUENCE CHARACTERISTICS! 

(A) LENOTH: 1811 beee peire 

IB] TYPE: nucleic eeid 

(C) STRAjroSJNESS: single 

(D| TOPOLOGr-. lineer 



(ii) MOLECULE- TYPE: 
(iii) HYPOTHETICAL: 



CDNA 



(iX) FEATURE: 

(A) NAME/KEY: CSS 

(B) LOCATION: 3.. 1599 
(0) OTHER IHFORHATIONs 



/product- "wheat prt>tex*l eDMA' 



(*i> SEQUENCE DESCRIPTION: SCO IB NO:9: 



' 0 3*0 7jil4aOJ)14&3fl 1:B S K B 



P. 88 



CCC*CGSrC>kCC<!OOCaCCCACACCGC(STCCQC< 

Cly Arg v»l Thr Gly Arg Pro Hi* Arg v&l Arg I 



CCGAQCAOC^CCACCGACAeTCCGGCGGCGCCC 
Alt Ser Ser Al« Thr Glu Thr Pro Al» Al* Pro 



CGC GTO COG CTC "ICC 
Cly V«l Ar« Uu S«r 
45 



CCCCAATCCCTCATTCTCOCSCCCCOOCATCACC* 
AU Clu Cy« V«l lie V«l Cly Ala Cly lie Sor ( 



CAGCCGCTCCCCACCaSATACGOCCTCAGCGAC( 
Gin XI* Leu Ala Thr Arg Tyr Cly v»l sex A*p I 



CCC CGC CUC CGC CCC QGC OCT AAC ATC ACC ACC < 
Al* Axg Asp Arg Pro cly Cly A*n tic Thr Thr V 
80 8S 90 

CACCCCTACCtQTCCGACCACCCACCCAACACCI 
Clu Cly Tyr L*u Tfp Glu Glu Cly Pro An smr 1 

100 105 



: A*5 SAT CAC TTC C7C 
t Ly* Asp A*» I*u V*l 



TTC COG CAC CCC AAC CCC CCC CCC TTC CTC CTC t 
Pbe Cly Asp Pro asp Ai* Pro Arg Phe v»l Leu 1 
130 135 

agcccgwccgtccaacccacccgacctccct: 
Arg Pro V«l Pro S*r Lyj Pro Cly Aep Leu Pro ( 

14S 150 1 

MtwcctCBCMCKCwsseccaerreccc 
S«r :i« Pro cly ly* Leu Arg Ale Cly Leu cly A 

t«C 1SS no 

CCACCTC^CC^CaSCCCCJ^CWCTCCGTCGACC 
Pro Pro Pro Pro ciy Arg Clu Glu S«r v*i Glu G 

;es las 

AACcrrccCTGccc^c-crrTc^ccccrcATCG 

Ajn lew cly A - * Cl^ V*: P-1e Glu Arg Leu lie c 



GOT G?A TA? OCT GCT GAT CCT TCG AAC CTT ACT 1 
Gly v*i Tyr Al* Gly Asp Pro s«r Lys Leu Ser r 
210 21S 

CCCAACCTCWACCTTCCACCACATTCCACCTJ 
Cly Vt* V»l T;p Arg L»u Clu. Clu lie Cly cly i 
:35 X3C 3 

ACC ATC AAC CCC ATT CAG CAT AAA CCC AAC AAC C 
Thr Ly» Al* :ie clr. asp iys cly lyi A*n i 



: AAC CCT CCA ' 
: Lys Al* Al# 1 

220 
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AAC < 



V CTT COS 
; Leu Ftp i 
260 

t GCC ATG < 
I Ala Met I 
375 



245 

GCA CCA I 



lya 
caa 



GTC AAG I 
Val Lya I 
290 

GGA TAT C 
Gly Tyr * 
305 



< AAT OCC I 
> Aan Al« 1 
2 BO 

f CTT ACC * 
i ii€U Thr 5 
295 



l TCT TTC ACC 
k S«r Phs Arg 
270 

i CTG OCT ACT 
r Leu Gly S*r 

265 



CAG « 
20 Cln i 
320 



t OCT TAT C 

t Gly Tyr ( 
310 

P ATC ATC J 
I He Met 1 
325 



P ACA AAO C 
t Thr Lyt I 

300 

I OCA CTT € 
i Gly L*u t 
315 



: CCA CTT 1 
j Pro L*vi 2 
340 



k. CCA CAT C 
kUaMpI 
345 



r OCT ACT CAT 
L Ala Str Asp 

335 

: TCA AAA TTC 

i Smx Ly« Mw 

350 

I AAA GAA CCT 
> ty» Glu AO* 

3S5 



r ASA AAA C 
s Arg-Ly* C 

370 



t CAT OCC C 
i Asp Gly « 

375 



: cac got i 
i cln aiy J 

300 

t GGG ACA . 
* Cly Thr 3 
395 



TCT 7 
5er S 
400 

AAC 1 



? AAT COT C 
> A«n Arg J 
40S 



k GOG ATC € 
r Gly il« v 
425 



k GTA GCA C 
1 V«l Gly ) 
435 



; tta err ens 

L Lau I4U L«u 
41S 

: AAC ACT CAC 
r Lya Thr Glu 

oo 

K ATG TTG ATA 
I Mot L*il Xlc 
445 



: CCT ACA < 
\ pro Arg J 
4 SO 

k. GCA ATA ( 

i Ala Il« i 
465 



: CTT CCA C 
f Val Arg \ 
460 

r OAT ccc 
i Asp Arg 
475 



CTT GCT GCT 



GCA 
60 Ala 
460 



X.y* Ser 
AAC TAC 



; GGC CAA g 
J Cly Gin C 
465 



; TAC CAC C 
f Tyr Aap c 
490 



GTC GCA GGA CTT GCC TTG GGC CCA TOC ATC GAG GCT GCG 
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ihC GAG ACT OCC TCA CAA GYA TCT GAC TTC TTS AGC AAO TAT GCC TAC 1583 
tyz Clu sk Ala Sar Gin Val Ber Aap Pha Leu Thr Lya Tyr Ala Tyr 
SIS 520 52S 

AA6 TGA TGGAAGTAGT GCATCTCTPC ATTTTGTTOC ATATACGAGG TCAOOCTAGG 1639 

Lya • 

ATCOGTAAAA CATCATCACA TTCWtACTG TTTCTTTAAT TBAAAAAACA AATTTTACTG 1699 

ATGCAATATC TULWl T Wl i TCTAGTTCGA GCATQTACAT COOTATOGSA TAAAGTAGAA 1759 

TAAGCTATTC TCCAAAACCA OTGATTTTT T TTGAAAAAAA AAAAAAAAAA AA Ull 

(2) ZNTOAHATSOH FOR SEQ ID 110:10: 

ti) SEQOEMCS CHARACTERISTICS : 

<A> LENGTH: 529 amino acid* 
<B) TYPE: aavino acid 
<D) TOPOLOGY: linear 

(it) K0LECOLE TYPl: procalp 

(Xl) SEQUENCE DESCRIPTION: SEQ ID NO: 10; 

Ala Thr hat Ala Thr Ala Thr Val Ala Ala Ala Smt vro Lau at? Gly 

' 1 5 10 15 

Arg val Thr Gly Arg Pro Hii Arg val Arg Pro Arg Cya Ala Thr Ala 

20 25 30 

Sar Sar Ala Thr Glu Thr Pro Ala Ala Pro Gly v»l Arg Lau Ser Al* 



Ala Leu Ala Thr Arg Tyr Gly v*l sar Aap Lau Leu Val Thr Glu Ala 



m 



Cly Tyr Leu Trp Clu Clu Gly Pro Aan Ser Phe Gin Pro Ser Aap Pre 

100 10S 110 

Val Lau Thr Met Ala Val Aap Ser Gly Law Lya Aap Asp Leu Val Pha 

US 120 125 

Gly Aap Pro A«o Ala Mo Arg Pha Val Lau Trp Glu Cly Lya Lau Arg 

130 135 1*0 

pro Val Pro sar Lya P*o Gly Aap Law Pro Phe Pha Sar Leu Hat Ser 

j«5 150 1SS 160 

lie Pro Gly Lya Lau Arg Ala Gly Lau Gly Ala Lau Gly lie Arg Pro 

165 170 175 

Pro Pro Pro Cly Arg clu Clu Sar Val Glu Glu Pha val Arg Arg Aan 
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Tyr 
Arg 
Hl« 



I Val Ttp 

> Lye XI* 

> Arg L*u 



r Tyr Val 1 

i 

i Lya S«r 1 

i Arg Fro I 
Pro 

Lye Glu 
Pro Arg 

' LftU Phe 1 
• lie Cly C 



ieo 

i Clu 
i Gly 

lie 



Val Fh« C 
Aap Pro i 



[ Leu lie Olu Pro I 



j olu He ( 

) 

> Lye cly 1 
» Lye Gly < 



i Ala Ala 

I 

i Um cly 
i Pro Pro 



Ser Gly 

Pha Cly 

cly Thr 
240 

Arg A*P 



340 
» Val 

i Cye 

Sax 



Ala Pro 

Leu Pro Aan Ala I la Ala ser Arg Leu 
Trp Lye 

Cly tyr 

Il« 
325 

s«r u« 

Ala Ala 
Lau lie 



280 
i Thr 1 

r Clu «ir 1 

> 

: Tllr lie 1 
> Asp Ala i 



r Val Ser Tyr Pro I 



, Aap Cly < 
f Val Glu 1 



Phe Gly 
He Tyr 



3SS 
Arg Lye 

Ser Lye 

ami Gin 
Val Cln 

330 

Aap Ila 

335 

Ph« Tyr 
Ala Ila 

Gin Leu 



i He Pro 

J 

i Ser Ala 
l Tyr Val 
i ser Ala 



Aan Arg 
40S 

Ser Thr 
Ala Val 
Aap Pro 



Ala Pro Ala cly Arg val Lau Lau I 

410 i 

Aan Thr t 



i Cly Val 

} 

r cln Val 



He Cly 
Gly Gly 
Ala Leu 
sar Aap 



[ Aap Leu Arg Lya 

I 

i Leu Gly Val Arg 
460 

Hia Leu Aap Arg 
475 

Tyr Aap Gly Lau 
490 

Cly Arg Cya He 

505 

Pha Lau Ttvr Lye 



Met Lev 
44 S 

Val Trp 

Leu Ala 

Phe Leu 

Clu Cly 
510 

Tyr Ala 



Pro cln 
Ala Ala 



Ala Tyr 
Tyr Lya 



'mmumwm m s k i 



R : 6 2 4 



(2) XHravOTlQN TOR 8BQ ID 110:11: 

(i) fc gQ OM C E CKAWCTS»ISTICS: 

(X) UNCTH: 1847 b*M peire 
<B> TYPE: nucleic ecid 
(C) STRANDBWESS: liable 
fOJ TOPOLOGY: lineex 

(111 M0LBC9LE TYPt: eflNA 

tut) hwothtticw-: no 



(iX> MATURE: 

(A) SAME/KEY: COS 

(3) LOCATIOH: 55..16B3 

(0) OTHCT XNFQiwATiON: /"produce- *»oybe*n protox-1 «WA 



(xi> SEQUENCE DlSCMmoS: S£Q ID W0;U 
CTTTAOCACA GTGTTGAAGA TAACGAACGA ATACTOCCAT 



TACTCTAACC AACC ATC 
Het 

355 



err 
30 Vel 


TCC 
S«r 


GTC 
Vel 


rrc 

Phe 


AAC 
Aba 

3«9 


CAC 
Clu 


ATC 
Il« 


CTA 
Leu 


rrc 

Rw 


ccc 
Pre 
365 


CCC AAC 
Pro Aen 


CAA 

cm 


ACC 

thr 


uu Leu 

370 


105 


CGC 
Ara 

35 


ccc 

fro 


ICC 
S«r 


CTC 
LeU 

375 


CAT 
Hi* 


TCC 
S«r 


CCA 
Pro 


ACC 
Thr 


rcr 

Ser 
360 


tie 
Ph* 


rrc acc 

Phe Thr 


TCT 

Ser 


ccc 
pro 

3fl5 


ACT CCA 
Thr Arg 


3.53 


AAA 
Lye 


rrc 

Phe 


CCT 
Pro 
390 


CSC 
APfl 


TCT 
S«r 


CCC 
ATff 


CCT 
Pre 


AAC 

Atm 

395 


CCT 
Pro 


ATP 
lie 


CTA CCC 
Leu Arfl 


TCC 

cy« 

400 


TCC 
Ser 


ATT GCC 
Zla Ale 


201 



CAA TCC ACC CC6 TCT CCC CCC / 
Glu Sec Thr Ale S«r Pro Pro I 
40S 410 



k CAC TCC CCC CCC GTC 

j Aap Ser Ala Pre Vel 



P. 92 



i 



GCC 
AU 


CTC 
Leu 


CCC 
Ala 


ACC 

Thr 


AAA 
Lye 
440 


CAC 
Hie 


CCC 
Ale 


AAT 
Aen 


ccc 
Ale 


AAC CTC CTC 
Afln VeL Vel 
445 


CTC 

vel 


ACG 
Thr 


CAC CCC 

Clu Ale 
4S0 


345 


i 


CCA 

*rg 


CAC 
Aap 


ccc 
Are 


CTC 
V»l 
455 


CCC 

Gly 


CCC 
Gly 


AAC 

A*n 


ATC 
lie 


ACC 
Thr 
460 


ACC ATC CAC 
Thr Met Glu 


ACC 


CAC 
Aep 
465 


CCA TAC 

Cly Tyr 


393 




CTC 
Leu 


TCC 

Trp 


CAA 
Clu 
470 


CAA 

Clu 


CCC 
Gly 


CCC 
Pro 


AAC 
Aen 


ACC 

Ser 
475 


TTC 
Phe 


CAC CCT TCT 
Gin Pro Ser 


CAT 

Ho 


CCA 
Pro 


ATC CTC 

Met Leu 


441 


t 


ACC 

Thr 


ATC 
Wee 
40$ 


vli 


vet 


CAC 
Aep 


ACT 
ser 


CCT 
Gly 
490 


TTA 
Leu 


AAC 
Lye 


CAT CAC CTT 
Aep Clu Leu 
495 

45 


CTT 

vel 


TTC 
Leu 


CCC CAT 
Cly Aip 


409 
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CCT ACC CAA OCA CTC CAA *CA TTA OCA KTT XTX T*C AOC TCA TCA CTX 
Are Sar Gin Cly v»l Glu Thr Uu Ciy Thr II* Tyr s*x S*r s*r Lau 
7$0 7 «5 770 

TTC CCC AAC CCA CCA OCR CCT CCA AGO CTT CTA CTC TIC AAT TAO ATT 
Fb« Pro Am Ar« Ala Pro Pro Cly Ar* V*l I*i Uu U» Mj Ty» He 
775 7B0 785 

OCA OCX CCA ACT AAT ACT GGA AW TTA TCG AAC ACC- CAC ACT CAA CTT 

Cly Gly Ala Thr A«n Thr Cly He Uu Set Ly* Thr Aap Scr ci» t*t» 
780 7J5 epo 

CTC CAA ACA OPT CAT CGA CAT TTC AOG AAA ATC CTT ATA AAC CCA AAT 
V«l Glu Thr val Mp Aro A«p Leu Arg lya He Lav Ha Mn **» 
805 810 815 

CCC CAC CAT CCA TTT CTA CTC GOG CTC ACA CTC TOO CCT CAA OCT ATT 
Al« Oln MB Pro Phc Val V»l Gly Val Arg Uu ftp P» Gin Al* IlJ 
630 825 830 835 

CCA CAC TTC TTA CTT CSC CAT CTT SAT CTT CTA GAT CTT CCT AAA CCT 
Pro Cln Pha Lau Val Cly His LwHipUwUutip Val Ala Lya Al* 
■40 MS 850 

TCT ATC ACA AAT ACT GGC TTT CAA COS CTC TTC CTT GOB CCT AAT TAT 
Sar Xla Atb Mn Thr Gly Pho Clu Cly tau Phc Lau cly Cly Aw Tyr 
855 860 865 

CMC TCT OCT CTT OCC TTC OCX CCA TCC CTT CAC OCA GCC TAT CAC CTA 
val Sar Cly Val Al* Lew Cly Ar« Cya Val Clu Cly Alt Tyr Glw Val 
870 875 880 

CCA GCT CAA CTA AAC CAT TTT CTC ACA AAT ACA CTC TAC AAA 
Ala Al* Glu Val Mn Mp Ph* Lr. Thr Mn Arg Val Tyr Lya 
885 8J B95 

TACTACCACT T T TTGTTT TT CTCCT O C AAT GGGtCATCBB ACTCTCCTCT TCCATTGAAT 

TATAATAATC TCAAA CTT TC TCAAATTCCT TCCATAOCTT TTTOGCCGCT TCTATTCCTC 

ATAATCTAAA ATCCTCTTTA ACTTTGAAAA aaaaa a aa a a aaaa 

(2) tHFORMATION PC* StQ ID MO: 12 : 

(i) SEQUOiCE CHARACTERISTICS: 

(At LEJJGTH: 543 amino acids 
(Bi TYP£: act no acid 
(0i TOPOLOGY: linear 

lilt MOLECULE TYPE: protain 

<xi) SEQUEMCX DESCRIPTION: SEQ XO 13; 

Hat val Sar Val Pha Aan Clu 11a Uu Pha Pro Pro Mn dn Thr Uu 
15 10 15 

Lag Aro Pro Sar Lau Hia Sar pro Thr Sar Pha Pha Thr s«r Pro Thr 
30 25 30 

Arg Lys Pha Pre Aro Are Pro Mn Pro Ho Lav Aro; Cya Sar Zla 



'wmumum m skb 



is 



IE? 



35 

eltt ciu 
so 

a*p cy» 

Ma Leu 

Arg Asp 

Leu Trp 
115 

a Thr Mat 
130 

P Pro Asp 

S 

L Pro Gly 

p Gly Lyr. 

> Pro Gly 
19S 

f Aap'Glu 
210 

r Ala G-y 

5 

L trp Lya 

< Ala lie 

1 Leu Pro 
275 

x Thr Met 

290 

i Leu ."ier 
r Ser Leu 
» Thr Val 

j Pro Leu 

35S 

j Pro Val 



Ser Thr 

Val val 

Ala Thr 
65 

Arg Val 
100 

Glu Glu 
Val val 
Ala 

Lya Leu 
lfiS 

tie Arc 
180 

Hie oli» 

Val Phe 

Asp Pro 

Lex Glu 
215 

Bin Glu 
2«0 

Lye Pro 

Leu Pro 

Trp Lye 

Thr Tyr 
325 

Val Leu 
340 

Ser Ala 
fcle Ala 



40 45 

Ala Ser Pro Pro Lye Thr Arg Aap 

SS 60 

Val Gly Glv Gly Val Ser Gly Leu 

70 75 

Lye Hie Ale Aan Ale Aan Val Vol 
90 

Gly Gly Aan lie Thx Thr Met Glu 
IDS 

Gly Pro Aan Ser Phe Gin Pro S«r 

120 125 

Asp Ser Gly Leu Lya Aap Glu Leu 



135 



140 



Arg Phe Val Leu Trp Aan Arg Lya 

130 155 

Thr Aap Leu fro Phe Phe Aap Lew 
170 

Ala cly Phe Gly Ala Lev Gly He 



CZ.u Arg Leu He Glu Pro Phe eye 
215 220 

Ser Lye Leu Ser Met Lya Ala Ale 

230 235 

Lya Aan Gly Gly Ser tie I** Gly ( 
250 

Arg Aan aiy Ale Ser Ly» Pro Pro i 

265 ; 



Ser Ala Pro 

cya tie Ala 
80 

Val Thr Glu 
95 

Arg Aap Gly 

110 

Aap pro Met 

val Leu Gly 

Leu Arg Pro 
ISO 

Mec Ser lie 

175 

Arg Pre Pro 
190 

Arg Aan Leu 

Ser Gly Val 

Phe Gly tye 
240 



Asa Ala He Ser Ala Arg Leu Gly 

295 300 

Leu Ser ser lib Sw Lya Leu Aap 

310 315 

Glu Thr Pro Glu Cly Val Val Ser 
330 

Thr He Pro Ser Tyr Val Ala Ser 
345 

Ala Ala Ala Aap Ala Leu Ser Lya 

360 365 

Val Ser lie Ser Tyr Pro Lya Glu 



Arg Lya Gly 

Aan Lya Val 

Ser Gly Glu 
320 

Leu Gin Cya 

Thr Leu Leu 
3S0 

Phe Tyr Tyt 

Ala He Are; 
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370 






375 


380 




5 


Ser Glu 


Cy» 


Leu lie Asp 


Cly Clu Leu 


Lye Cly Phe 
39S 


Qly SlA Lei. His 
400 


Pre Arg 


s«r 


Gin Cly Val 


Clu Thr Lau 


Gly Thr Ila 

410 


Tyr Ser Ser Ser 

415 




Leu Phe 


Pro 


Xan Arg Ala 
420 


Pro Pro Gly 
425 


Arg val Lau 


Lau Lau Asn Tyr 
430 




lit Giy 


cly 
435 


Ala Thr Aen 


Thr Cly lie 
440 


Leu Ser fcya 


Thr Asp ser Glu 
445 


IS 


Leu Val 


Glu 


Thr Val Asp 


Arg Asp Lau 


Arg Lyv He 
4fi0 


Lew tie Asn Pro 




465 


Gin 


Asp Pro Phe 
470 


Val Val Cly 


Val Arg Leu 
475 


Trp Pro Gin Ala 
480 


20 


lie fro 


CIS 


PhC L*U Val 
485 




Asp Leu Lau 
490 


Asp Val Ala Lys 
495 


25 


Ale Ser 


lie 


Arg Asa Thr 

500 


Cly Phe Glu 
50S 


Cly Leu Phe 


Leu Gly Gly Asn 

510 




Tyr val 


Ser 

SIS 


Sly Val Ala 


Leu Giy Arg 
520 


Cya val Glu 


aly Ala tyr Glu 
S25 


30 


Val Ala 


Ala 


Clu Val Asn 


Asp Phe Lev 
53S 


Thr Asn Arg 
S40 


Val Tyr Lyi 




(2) XHTOKKATION FOR SSQ 


ID HO: 13: 







lit SEOUENCS CHARACTERISTICS! 

(Al LENGTH: 583 base pairs 
(8) TYPE: nucleic acid 
(CI STRAMD23NESS : tingle 
ID) TOPOLO^T: linear 

(ii) MOLECULE TYPE: UNA {genoale) 

liii) HYPOTHETICAL: NO 



Uxt FEATURE: 

(Al NAMfi/KSY: prcwotar 
(B) LOCATION: 1- .583 

(D) OTHER INFORMATION: /function* -exabidopais procox-1 
promoter* 

(xil SEQUOJCE DESCRIPTION: StQ ZD NO: 13: 
CAATTCCGAT CGAATTATAT AATTATCATA AATTTCAATA ACCATCTTCC CfifTATTAA 
ACAOCTTTAA TAAACTTTCC TAATAATCCA CTTTCACTTC AAACTCGATT CTCATGTAAT 
TAATTAATAT TTACATCAAA ATTTGGTCAC TAATATTACC AAATTAATAT ACTAAAATCrt 
TAATTCCCAA ATAAAACACT AATTCCAAAT AAACGCTCAT TATCATAAAC ACGTATTGAA 
CTTCATAAAC CAAAGCAAAA ATAATQOGTT TCAAOGTTTG OCTTATATAT OACAAAAA A A 
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VAAAAOCTT TGCTTATATA TCtATPWCC CfATAACCAT GWAIACAAA moOOCCT* 360 

ACTAAAATAA TAAAATAAAC GTAATGCJCC TTTTTATXTT 1QQGTCAAAC CCAACTCTAA 420 

ACCCAAACCA AAGAAAAAGT ATACOOTAOG OtACACMUC TTATOCTCTG TBTCATTOCA 460 

CCTCAATATr TCXmCWC T KJfCCTm! TTCTGAA5AA GATTACCCAA TCT6AAAAAA 540 

ACCAAGAAQC TQACAAAATT CCGAATTCVC TDCOATTTCC ATS 583 



(2) BffORKATlOW FOR SBQ IP M0.14: 

(i) SKQWWCE CNARACTEWSTJCS: 

(A) LENGTH; 3S48 base pain 
<B) TYPE: nuclaxc Mid 
(CI STRAKDHWESS: «in*l« 
(D) TOPOICGY: linMX 

lii) B01ECOLS TYPE: OKA (| 

Uil I mfWlUBMCAti NO 



Pi 

m 

L S. 
j . g 



(xi) S 


OESCRimCH; S£Q 16 >»!l4 






TCGAtCTfTC 


TACGCTfiATC 


CCCAAATCrr CCTOCGAAflC 


CCCTOGCGCC TCTGCCCCTT 


60 


GGAGCTCGTG 


GCCTCAAA6A 


OCTTTQCTGT TGCCCCGAAG 


ATTGTGAGGr AtATTCTGAC 


130 


CTCTGACACT 


CACTTCCTTT 


GTCCTCACTT TGAfiTOGACT 


TAtGCATTGA CCTBAC17HSC 


180 


CTCAQATOGA 


TTCTTCCTCC 


GAAGCCCCTG CTCATTTCCG 


ACAATCTCtA ATCWATTCC 


140 


CTTCTTTCGC 


CAAAATCTCT 


CAGCTTBGAT OTACTCATCC 


ATCTtCTGAA GCAflCWCTC 


300 


CACACjTTIGT 


GQAOQCTTCC 


TGGCGAAATA TTOGSCTGttk 


CCTCCTCCAC GAAGACCCTT 


360 


SATCATGOCC 


TCAATQACAA 


TCTCATTCCG CACCOTASOC 


GCTTGTSCCC TCAATCCCAA 


429 


GAACCTTCCT 


ACATATCCCT 


GAAGGTATTC TTCCTGATCT 


TCTCTCeATT GCAACACACC 


480 


CTCACCTCTO 


ACCGACTTCO 


TTTOAAAOCC TTGGAAGCTA 


CTAACCAACA TGTQCTTAAG 


540 


CTTCTCCCAC 


GACGTCATAC 


tccctcoccg aagacaagaa 


TAccAwrnr oogctacaot 


600 


CCGCACTGCC 


ATCACGAAGG 


ACTTCGCCAT GACTACAGTG 


TTGACCCCAT ACOAAGATAT 




ACHOCTTCG 


TACCTCATCA 


(JAAACTBCTT TGGATCTGAC 


TOCCCATCAT ACATCO0GAC 


720 


CTCAGGTCCC 


'I'IdATGATG 


QGC**CCATGG OCTACCCTCC 


AGTTCTGCTG CCAAC&JACA 


780 


AQCATCATCA 


AAACTAAAOC 


CATCATCATT AAAATCATCA 


TACCATCCAT CCTCGTTOAA 


940 


TAACCCTTC* 


TGACCAAOCr 


CCCTGTCTTC GOOCCfTCOA 


TCTTffrtCAT C1TGAACAAQ 


900 


ATGACGCACT 


VCTTCACTGG 


CTTCGTCSAT CM tCW rGC 


AGAtCACCCA GTCOCACCA? 


960 


CTTCTCCTTC 


incriiGVA 


cncmATC gatcatctcc 


ATSTCCCTMJV TCTCTTQGTC 


102O 
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CAACTOCTCC TCTTGGAGTC 7CAGACTGUT CQCTWCCTC TTCTGCCTTC QAGCCTCTCG 
aGAGAAAGA OTTTCTTGAT TTOGGTQCA5 CGGCTGCAGT GCAG7TSGTCC CTOGTGCXCA 
WXTTTCTTC GGTCQCAT6A CAAAGGTCAC TGC7TGCCSA AGGTGGTCGA AAMJGGTTCA 
CTAGAGCTCG GAGCCAATCT 1GGGGACTTC TCAACTGCTA TGAGTTAAGA ACAAGGCAAC 
ACAAAATCTT AAATATTAAT AGCTTTCATC TTfCGAAGCA tTAtTTCCCT TTCGGTATAA 
TSATCTTCAO ACGAAAGAG7 CCTTCATCAT TGCGATATAT GRAATAGAA OGAOGAGCAT 
ATGAAATGTA ASAGACAACA TGAACAATCG TGTAGCATTC TTAATTCATC ATCATTTTAT 
TATTATGGAA AAATAGAAAC AATATT5AAT TACA-'ATOTA OCWWUCW GACAGAAGAT 
AAAAGTACAA GCTTGACOCA CGAGCAAGTA CAAGTCASTG TSAACACTAC GGOCGTACTG 
TTCATCTATT TATAGGCACA GGACACAGCC TCTGACAAAT TACACTCATG CCCTTTACAT 
TTACTATTCA CTTATAGAAA WTCTATGAG GACTGGATAG CCTCTrCCCC TTTAAGTCGG 
TGCCmTTC CGCGATTAAG CCGAATCTCC CTTOCGCATA GCTTCGCA GC AfCOSCAACC 
TTCCTCACGfc TCATGCCCTT CTCATTGTGT ATGC'I'l'l'lAA TCCTGAATTC GAAQ0TACCT 
CTCCATAAAC CATACTTGGA AGACATTCT? AAATTATGTT TITGAGOACC TTCGGAOGAC 
GAACGCCCCC AACAGTCOTO TtTPTCAGGA CCTTCQGAAG ATBAAGGCCC CCAACAACAC 
CTATCCATAA AACCAACCTA 7CCACAAAAC CGACCCCATT CACCCTTCAT TTGCCTCACC 
AACAACCCTA ATTAGCTTCT TSCTTT AAAT TPTfTAQOGT CAATTTOGTC AfCACCATCC 
ACTGTCACTC CACAAACTCA J-'ATCAATAA ACAGACTCAA TCACCCAAAC TGACCATACC 
CATAAAACCC CCCCACCCTT CTACCGCCTC GCCAGAAACC ACAAACCCTC ATTCAGAGTT 
CAAAC7TAAA ACGACCATA> CTTTCACCTT GGAACTCGAA tCAOGTCCAT TTTTTTCCAA 
ATCACACAAA ATTAAATTTC GCATCCGATA ATCAAGCCAT CTCTTCACTA T GCTT T T AAG 
TGTTGCTCAC ACTACTGTAT TTATCGACTA ATCACCT GTO TATCTCATAC AATAACAf AT 
CAGtACATCT AACTTCTTAC TCAAWACCA AAACCGAATT AWMCTTCC AAAAAGOTTA 
TCGACTAGTC ACTCAATTAC CAAAACTAAA CTTTACACTT TCATGTATGA CATCCAACAT 
GACACTCTAC TGGACTAAAC CACCTTTCAA GCTACACAAC GAGCAAAAAT AACTAAC.fT 

cctagttota ggagctaaac tatatctcca caacaatact taacggaagc ccccaacoac 
ttaaaatcc ttttaCctct Tgaaactttt ctcgtgctct actttttcac tttaaacttc 
aaaatttcfcc attttatcac cccttaach: ttaaaaccat ttaaattaca ttcttactag 
attatagatg attttgttgt qaaaagtttt taacacatgt ttacacatto attaaaatca 

TTTCTTCAAT TTCCTAGAGT TAAATCTAAT CTTATTAAAA CTATTAAAGA TaCTTTCaCG 

AGCTCTAAAT ATTTTTATTT TTTCATTATG GAAirrTCTT AGAATTCTTA TACACCTTTf 

rrrCTGCTTT AAAACCCTTG CCATCTTTTT AVCAACTTTT TTTTCTATTT TTTGAAATTt 
51 









v> 




id 


1060 




1140 


g 


13 DO 


i 


1260 


i 


uJO 




1380 


? 






1440 


y 


1500 








1560 


*■» 


1626 




1680 






g 


1740 


> 


1000 




I860 


5 


1920 


« 


1980 




2040 




2100 


•; 


2160 




2220 




22B0 




2340 




2400 




2460 




2520 




2580 




2G40 




2100 




2760 




2820 




2880 




2940 
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■» 



TTOGAAAC 
AAAfCAACTT 

J 

AAAATTAAAC 
MtCAASAAG 
10 TfGATGGOAA 
AAATATTTST 
CTAGAGCGAA 

ACCTGTAACG 
20 TCCTCTCATC 
CGCCACAGCC 
TWJOC< i ^CTC 



CACTTCTAAC 
ATQAAAT7ST 
CAACTTACGG 
CCCTAGACAT 
CATAAGACCA 
GCTOCAATTO 
CAAOCCCCAC 
GATTGGATAT 
TTCCACTSGG 
CACTCOOCCC 



CATCAGTGCC 
TGTCACGOAG 
MSaCTACC , 



CQCCATCCAC 
GCATCCACCO 
CTCTOCACC0 
GCCCGCGCCC 



GCGOTASAAG 
CTTQGAAACT 
AATCGCCCAA 
AATCTAAATO 
TAACCCTACT 
ATCCTQNCC 
GTCACCCOTC 
CAACOGAACC 
CCATCCTTAA 
CACAGOOGCT 
CCACCCCTGC 
GACTCAOCGT 
GCCCGCOCCT 
CGCAGGCGCT 



AtTTAl'l'l 1C 
ACCTCXAACC 
CATATCTCSA 
OTTTCACAAT 
TCACAGAGaT 
tCAAAITCAC 



AATCACOCAC 
CTCCAACCCC 



CTACACTTAT 
CU7TAGAATC 
TTAAAGTOOA 
1GAQBGTTAT 
AAAAGOGTTA 
CCICCAACCA 
CGAACCAGCT 
OOCAATGCOA 
AACQOCCCTA 



ATCOCCCCTA CTCAACCeCA 



ATCTACAACA 
AATTTCAATC 
TATSSATACA 
TTTTTCAACT 
TT1 flTTCAG 

AocccAOorr 

CTTCTOCACA 

TTce c Aocee 

CCCCA1CTCC 
AAATQCTCQC 
CCCBAATACC 



GTCC OC CGAC T U C fc TT tt T C S TQQGCQGASG 
GGCCACCCOG CACSOCGTCe (SOBAOGTOCT 
CAACATTACC ACCGTCCACC GCCCCGAQGA 



3000 
3060 
3130 
3190 
3240 
3300 
3360 
3420 
3480 
3S40 
3600 
36C0 
3720 
3730 

384B 



■be invention as described herein U cometrmlaied to include the fo)lowmg cnumertred 
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Pi 

m 
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1 , A recombinant DNA molecule comprising a plant pro m porphyriftogen oxidase 
(pnxoi) promoter or a functionally equivalent derivative thereof. 

2, A chimeric gene comprising * plant pretax promoter operably linked 10 1 
bewrologous DNA coding sequence. 

3, The chimeric gene of claim 2 wherein said plant protox promoter is from a protes-i 



4. The chimeric gene of claim 2 wherein said plant proton promoter is from a protox-2 



5. The chimeric gene of claim 2 wherein said proiox promoter is from i plant selected 
from the group consisting of Arabidapsis. soybean, cotton, tobacco, sugar beet, oilseed rape, 
maize, wheat, sorghum, rye, oats, turf gras>i and rice. 

6. The chimeric gene of claim 5 wherein said promoter is from ■ plant selected from the 
group consisting of Arabidoptis and maize. 

7. The chimeric gene of claim 6 wherein said promoter is at least 300 nucleotides in 



8. The chimeric gene of claim 7 wherein said prorooier is at least 500 nucleotide* in 
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9. The chimeric gene of claim 8 wherein said promoter is from Anbidnpsis and his the 
«quencc set forth in 5EQ ID No. 13, 

10. The chimeric gene of claim 8 wherein said promoter is from maia and has the 
J sequence set forth mSEQ ID No. 14. 

I i. The chimeric gene of claim 2 wherein said hcieroloxous coding sequence encodes a 
modified, herbicide>resi$unt fonn of a plant enzyme. 

io IZ The chimeric gene of claim 11 wherein said ptatcwTmeUjdeciedfiwthejiovp 

consisting of tmidazoteglycerol phosphate dehyratase OGFD), EPSP synthase, ghwmine 
synthetase (GS), acetyl coenzyme A carboxylase, acetolaetate synthase, and pnxoporphyiinogen 
oxidase (protox). 

15 13. The chimeric gene of claim 12 wherein said plant enzyme is pretax. 

14. A recombinant DNA vector comprising ihe recombinant DNA molecule of claim 1. 

15. Plant tissue comprising the chimeric gene of claim 2. 

16. A plant comprising the chimeric gene of claim 2. 

17. The plant of claim 16 wherein said plant is selected from the group consisting of 
ArabidopsU. soybean, cotton, tobacco, sugar beet, oilseed rape, maize, wheat, sorghum, rye, 

ZS oau, turf grass and rice. 
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Promoters naturally associated with plant protoporphyrinogen oxidase (protox) coding 
sequences, and derivatives thereof, arc provided. These promoters can be used to control the 
expression of u\ operably Imkcd heterologous coding sequence in a pl«t cell. . Theae promoters 
in: pwticuUfiy useful for expressing modified faro of hemicide tx&t enzymes, pwtkularty 
modified forms of protox, 10 achieve tolerance to herbicides which inhibit the cemxpmiins 
unmodified enzymes. Racombirani DNA mokcule and chimeric genes comprising theje 
promowrs Irt proviucd, as well as plot tissue tnd piano containing such chimeric jenes. 
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